Mangle

Statistics: July

A log is kept of the options and results for each random search. I wrote a small visual basic program to analyse everything based on the data from the past 30 days.

General
Number of searches through July 31 7679
Total number of words used in searches 22669
Number of times safemode was used 351
Number of times Frames was used 6821
Number of times Javascript wasn't used 459
Number of times the search failed for an unknown reason 81
Number of times the search failed for exceeding Google's limit of 1000 queries in a day 886
Number of times no URL was found 139
Number of corrupt entries in the log 27
Actual number of usable searches 6546
Number of words in the database 7111


Website URL hits
The root URL was counted for each site visited. For example, If a web site address 'www.news.com/paper/june/00232.html' was hit, only 'www.news.com' was counted.

Web site root URL Hits (out of 6546)
www.geocities.com 50
news.bbc.co.uk 38
wortschatz.uni-leipzig.de ** 38
www.cnn.com 22
members.aol.com 21
citeseer.nj.nec.com 17
english.pravda.ru 16
www.pbs.org 16
www.guardian.co.uk 14
www.nap.edu 14
www.epa.gov 11
www.law.emory.edu 10
news.com.com 10
www.usatoday.com 10
www.usdoj.gov 10
www.angelfire.com 9
www.cdc.gov 9
www.epinions.com 9
www.ibiblio.org 9
www.wired.com 9
Other notables:
members.tripod.com 8
www.bbc.co.uk 7
www.time.com 5
www.ftc.gov 5
dmoz.org 5
www.zdnet.com 3
www.fcc.gov 3
www.canoe.ca 3
www.microsoft.com 2

** Note that the site wortschatz.uni-leipzig.de contains a 10000+ word list of the english language, and so this page is generally found if an odd combination of rarely used words are used in the search.


Top-level domain hits
The top-level domains (.com, .edu, .org, etc) in the web URLs were counted. Not surprisingly .com makes up the majority of hits, with .org and .edu far behind.

Top-level Domain Hits (out of 6546)
.com 2293
.org 913
.edu 545
.uk 298
.net 219
.gov 180
.au 131
.ca 130
.us 96
.de ** 65 (27)
.jp 43
.ru 28
.nl 26
.se 24
.mil 24
.gr 24
.nz 23

** Once again the site wortschatz.uni-leipzig.de is from the .de top-level domain, and therefore artificially inflated the number of times a .de site was hit from 27 to 65.


Domain File Extension Names
The filetypes that were used in the URL were counted. .html and .htm make up more than 70% of all the filetypes encountered. Many sites only had a root URL and no filetype to show.

File Extension Hits (out of 6546)
.html 2745
.htm 2031
Root (no extension) 615
.asp 315
.shtml 179
.cfm 103
.txt 96
.php 64
.stm 40
.ppt 20
.php3 20
.rtf 18
.gz 17
.cgi 14
.jsp 13


Word Frequencies
Although there is an equal probability of any word being picked out of the 7111 words in the database, some words came up more often than others. These are the words that were most often picked. There seems to be a bug where a word is occasionally picked twice in a row ...

Words used 26 times: animated, concedes
Words used 24 times: troy
Words used 23 times: swimming
Words used 22 times: equals
Words used 21 times: damaged, imposing
Words used 20 times: bias
Words used 19 times: drag, intent, outgoing, typing
Words used 18 times: commonly, incentives
Words used 17 times: analyses, municipal, navy, protection, purely, strength, telling, testing
Words used 16 times: (17)
Words used 15 times: (18)
Words used 14 times: (21)
Words used 13 times: (33)
Words used 12 times: (56)
Words used 11 times: (78)
Words used 10 times: (121)
Words used 9 times: (150)
Words used 8 times: (223)
Words used 7 times: (285)
Words used 6 times: (426)
Words used 5 times: (493)
Words used 4 times: (642)
Words used 3 times: (821)
Words used 2 times: (981)
Words used 1 times: (1089)
Words never used: (1635)


Frequency of option 'Number of words'
Number of times 1 word was used in search 264 3.4%
Number of times 2 words were used in search 431 5.6%
Number of times 3 words were used in search (default) 6631 86.3%
Number of times 4 words were used in search 115 1.5%
Number of times 5 words were used in search 238 3.1%


'Country' options used:
The number of times the search was limited to specific countries (the countries which were used less than 10 times were omitted).

Option: Country Number of times used
United States 255
United Kingdom 35
Canada 26
Australia 24
Great Britain 23
Denmark 22
Japan 16
Belgium 12
Netherlands 10


'Language' options used:
The number of times the search was limited to specific languages (the languages which were used less than 10 times were omitted). Once again the language option was hardly used -- Mangle is very good at finding non-english websites when only 1-word searches are used, you should try it!

Option: Language Number of times used
English 379
Japanese 16
Danish 16
Dutch 11
French 10

Other stats:

Other stats:
2005
January
2004
December
November
October
September
August
July
June
May
April
March
February
January
2003
December
November
October
September
August
July
June
May
April
March
February
January
2002
December
September - November
August
July
June
March



Home     Browser Toolbar     Help     Statistics     Search History     Links     Contact