Mangle

Statistics: June

A log is kept of the options and results for each random search. I wrote a small visual basic program to analyse everything based on the data from the past 30 days.

General
Number of searches through June 30 4506
Total number of words used in searches 13004
Number of times safemode was used 42
Number of times Frames was used 4201
Number of times the search failed for an unknown reason 73
Number of words in the database 7111


Website URL hits
The root URL was counted for each site visited. For example, If a web site address 'www.news.com/paper/june/00232.html' was hit, only 'www.news.com' was counted.

Web site root URL Hits (out of 4506)
www.geocities.com 49
news.bbc.co.uk 24
wortschatz.uni-leipzig.de ** 23
www.cnn.com 18
members.aol.com 15
english.pravda.ru 15
www.pbs.org 12
www.guardian.co.uk 11
citeseer.nj.nec.com 11
www.amazon.com 10
www.washingtonpost.com 8
www.usatoday.com 8
www.wired.com 7
www.theatlantic.com 7
www.ftc.gov 7
www.fas.org 7
web.mit.edu 7
www.un.org 6
www.fansonly.com 6
www.angelfire.com 6
seattlepi.nwsource.com 6
abcnews.go.com 6
Other notables:
www.nature.com 5
www.microsoft.com 5
www.epinions.com 5
members.tripod.com 5
www.stanford.edu 4
msdn.microsoft.com 4
www.nist.gov 4
www.greenpeace.org 3
www.undercover-brother.com 2

** Note that the site wortschatz.uni-leipzig.de contains a 10000+ word list of the english language, and so this page is generally found if an odd combination of rarely used words are used in the search.


Top-level domain hits
The top-level domains (.com, .edu, .org, etc) in the web URLs were counted. Not surprisingly .com makes up the majority of hits. Limiting the search to a specific country really doesn't have much bearing on the top-level domain, as I believe the Google regional searches are based on the country of registration, not the domain name.

Top-level Domain Hits (out of 4506)
.com 1654
.org 612
.edu 387
.uk 192
.gov 165
.net 148
.ca 89
.us 62
.au 60
.jp 29
.de ** 25
.mil 22
.nl 17
.fr 13
.dk 13
.se 11
.ie 11
.nz 10

** Once again the site wortschatz.uni-leipzig.de is from the .de top-level domain, and therefore artificially inflated the number of times a .de site was hit from 2 to 25.


Domain File Extension Names
**Added July 31**
The filetypes that were used in the URL were counted. .html and .htm make up more than 70% of all the filetypes encountered. Many sites only had a root URL and no filetype to show.

File Extension Hits (out of 4506)
.html 1803
.htm 1358
Root (no extension) 540
.asp 194
.shtml 131
.cfm 79
.txt 43
.php 35
.stm 26
.ppt 14
.php3 12
.jsp 7
.cgi 6


Word Frequencies
Although there is an equal probability of any word being picked out of the 7111 words in the database, some words came up more often than others. These are the words that were most often picked.
Words used 18 times: calm
Words used 17 times: shake
Words used 16 times: variables
Words used 15 times: mind
Words used 14 times: characterization, conservatives, equals, gainers, turn
Words used 13 times: economical, memories, personally
Words used 12 times: birds, bombing, embassy, essential, grab, level, occurring, proposing, repeated, scroll, sellers
Words used 11 times or less: Too many! :)


Number of times individual words from the database have been used in the searches
Out of 13004 total words used, this is the number of times that the same word was chosen from the database of 7111 words.

Number of times a word was chosen once: 1498 21.1%
Number of times a word was chosen twice: 963 13.5%
Number of times a word was chosen 3 times: 624 8.8%
Number of times a word was chosen 4 times: 484 6.8%
Number of times a word was chosen 5 times: 297 4.2%
Number of times a word was chosen 6 times: 213 3.0%
Number of times a word was chosen 7 times: 150 2.1%
Number of times a word was chosen 8 times: 78 1.1%
Number of times a word was chosen 9 times: 60 0.84%
Number of times a word was chosen 10 times: 29 0.41%
Number of times a word was chosen 11 times: 18 0.25%
Number of times a word was chosen 12 times: 11 0.15%
Number of times a word was chosen 13 times: 3 0.04%
Number of times a word was chosen 14 times: 5 0.07%
Number of times a word was chosen 15 times: 1 0.01%
Number of times a word was chosen 16 times: 1 0.01%
Number of times a word was chosen 17 times: 1 0.01%
Number of times a word was chosen 18 times: 1 0.01%


Frequency of option 'Number of words'
Number of times 1 word was used in search 277 6.1%
Number of times 2 words were used in search 257 5.7%
Number of times 3 words were used in search (default) 3805 84.4%
Number of times 4 words were used in search 37 0.8%
Number of times 5 words were used in search 130 2.9%


'Country' and 'Language' options used:
The language option was hardly used, and when it was, only english was chosen. Mangle is very good at finding non-english websites when only 1-word searches are used, you should try it! The country option was used a little, as shown below.

Option: Country Number of times used
United States 94
United Kingdom 63
Canada 39
Japan 12
Denmark 10
Netherlands 9
Sweden 6
Austria 6
Belgium 6
China 5
France 4
Brazil 3
Spain 3
Czech Republic 3
Germany 2


The number of times that, within a single search using 2-5 words, one word from the database was repeated within that search:
Zero. It is extremely unlikely that this would occur, as the probability of the same word used in a 2-word search is 1 in 51 million, and in a 5-word search, 1 in 5 million.

Other stats:

Other stats:
2005
January
2004
December
November
October
September
August
July
June
May
April
March
February
January
2003
December
November
October
September
August
July
June
May
April
March
February
January
2002
December
September - November
August
July
June
March



Home     Browser Toolbar     Help     Statistics     Search History     Links     Contact