portal.htmbasic.htm → local.htm
 
Updated 18/April/2008
Local
Specialized
Specific
       
search
lore
Introduction
Files and images repositories searching   ~   Searching the 'invisible' web
Local search: valuable pointers   ~   Webrings searching
Other stuff 1   ~   Journals

Introduction
The importance of specifity and of local & specialised searching


The importance of getting to your signal 'behind' the main search engines should be quite evident. You can obviously go regional, visit usenet, use your own bots or many other tools, but you will also find various possibilities to query specific search engines on the web.
Of course each big site may (and probably will) have also some specific 'hyper-local' search engine, for www.searchlores.org we have for instance our amazing "namazu" search engine: [search @ fravia]

So-called "local" (or specialised) search engines are extremely important to fetch info. Freepages providers, counters, stats applications and all other finalized search engines... that you can use at leisure... for fun, study purposes, "strafing", slowbombing and/or knowledge profit. This page is just a proof of concept meant only to give you an idea, a scent, of the many 'local' treasures you can and will dig out on the Web.

"Local" searching: valuable pointers 

"Local" searching:
valuable pointers
    [Hitbox] [Usenet] [Erolinx] [Idexer] [Acrofinder] [OLGA] [AOL]
Russki area tops: [Rambler's top 100] [Spylogs's top] [1000stars's top]

[webrings] [irc searching] [weblog searching] [recall searching]

[Homepages searching] : [Xoom] [Tripod] [Fortunecity] [Geocities] [Crosswind]
See also: LocalPageProviders (huge but slightly obsolete list): [LPP]

Files and images repositories  (by Kane and Fravia+)


§ This section is obsolete. For a more recent treatise of these matters, confer files repositories


Searching files repositories can be VERY useful, as an example try the following (banal) "bookz" search: rapidshare O'Reilly Here a short list of files and images repositories, note also how using the names of these repositories as queries you'll discover many a juicy messageboard...

Here some simple "repositories" webbits:
For pdf files: pdf "rapidshare.de/files" site:rapidshare.de
For movies/video files: +inurl:avi|mpg|wmv site:rapidshare.de
For mp3 files: +inurl:wma|mp3 site:rapidshare.de
For software/zipped files: +inurl:exe|rar|zip site:rapidshare.de
Replace rapidshare.de with megaupload or anyone of the repositories below in order to search for files available on different servers.

Most recent additions (4 June 2006):

http://www.oxyshare.com/: 700 mega per file per 30 days. Relatively fast.

http://www.badongo.net: 1 giga per file forever! ("For now there is no file life-limit and only files violating our Terms & Conditions are deleted")

http://bigupload.com/: 2 Giga! per file forever, but deleted after 30 days if unused

http://www.filefactory.com/: 500Mb per file forever, but deleted after 30 days if unused

http://www.insightz.net/: 500Mb per file forever, but deleted after 30 days if unused

http://en.mylivepage.com/: 1 giga, homepage and files hosting service.

http://www.axifile.com/: 150Mb per file, deleted after 30 days

http://www.filedepartment.com/: (Does not seem very trustworthy): Up to 1000 downloads per file per day for FreeShare file, Up to 5000 downloads per file per day for TotalShare file, Up to 5 files for upload per day, Up to 50 mb for upload per file, 10 seconds of time limitation for FreeShare file, 5 seconds of time limitation for TotalShare file, Auto file deletion after 10 days since last download.

Name Size Limit Restrictions Filetypes Registration Domain Hosted
RapidShare 50Mb Removed after 30 days of no use. ALL No USA & Germany USA
MegaUpload 500Mb Removed after 30 days of no use. ALL No Hong Kong USA
Spread-It 500Mb Removed after 14 days of no use. ALL No USA USA
TurboUpload 70Mb Unknown. Has a dumb anti-piracy messege stating proxies won't hide your IP. Total bullshit ;) ALL No United Kingdom USA
4Shared 10Mb Restricted to 100Mb. ALL Yes Ukraine USA
YouSendIt 1Gb Removed after 7 days or 25 downloads, whichever comes first. ALL No USA USA
MyTempDir 25Mb Removed after 14 days or 1200 downloads, whichever comes first. ALL No USA USA
SendMeFile 30Mb Removed after 14 days. ALL No Czech Republic Czech Republic
UltraShare 30Mb Removed after 30 days of no use. ALL No Canada USA
BestUpload 600Kb Removed after 30 days of no use. ALL No Denmark USA
UpDownloadServer 250Mb Uploaded content manually checked for legality. ALL No Germany Germany
Li.ru 32Mb Removed if downloaded less than 5 times in 24 hours (Thanks Mordred). ALL No Russia Russia
PeruShare 50Mb No idea, I don't speak Peruvian ;) ALL No Peru Peru
Webfile.ru 20Mb Removed after 7 days of no use. ALL No Russia Russia
Zalil.ru 50Mb Unknown ALL No Russia Russia
Spbland.ru 50Mb Downloads restricted to .ru ALL No Russia Russia
SexUploader 500Mb Unknown. ALL No USA USA
UploadTemple 30Mb Removed after 30 days of no use. Multimedia, Zip No USA USA
UploadHut 50Mb Removed after 30 days of no use. ALL No India USA
FileHD 60Mb Unknown, site is down tempoarily. ALL No Netherlands Netherlands
DataPickup 1.5Gb Removed after 7 days or 25 downloads, whichever comes first. ALL No USA USA
FileUpYours Unlimited Unlimited, but warns it isn't for mass file distribution. ALL Yes USA USA
FileFront 1Gb Have to upload within 10 hours. ALL Yes USA USA
TransferBigFiles 1Gb Removed after 5 days or 20 downloads, whichever comes first. ALL No USA USA
FTPZ.us 880Mb Unrestricted, contact is a gmail account. Be wary. ALL No Austria Germany
TheFileHut 250Mb Unrestricted. ALL Yes USA USA
FileCache 125Mb Removed after 30 days of no use. ALL Yes Germany Germany
PushFile Unknown Tempoarily offline due to copyright violations. *lol* What did they expect people to use it for ;) Unknown No Brazil USA
SaveFile 60Mb Removed after 14 days of no use. ALL No Denmark Denmark
FreeUploader 50Mb Unknown. ALL No USA USA
UShareIt 50Mb Unknown. ALL No USA USA
ShareFiles.ru 50Mb Removed after 30 days. ALL No Russia Russia
IllHostIt 40Mb Unknown. ALL No USA USA
FileBuffer 40Mb Unknown. ALL No Croatia USA
35 30Mb Unknown. ALL Yes USA USA
SwiftDesk 30Mb Unknown ALL Yes USA USA
Falcon 30Mb Unknown. ZIP, RAR No USA USA
RipWay 30Mb Unknown. ALL Yes USA  
MyFileStash 25Mb 2000Mb a month data transfer. ALL Yes USA USA
11mbit 20Mb Unknown. ALL No Germany Germany
Uppit Unknown. Removed after 30 days of no use. Images, Movies, Applications. No USA USA
UploadX 3Mb Unknown. ALL No USA USA
FilePost 500Mb Unknown. ALL Yes Canada USA
StorageFun 50Mb Unknown. ALL Yes USA USA
Fdrop 100Mb Unknown. ALL Yes Canada Unknown
BigFiles 10Mb Unknown ALL No USA USA
zUpload 500Mb Unlimited apparently. ALL No USA USA
ShareBigFile 1Gb Removed after 7 days. ALL No South Africa South Africa
DropLoad 100Mb Removed after 7 days or 1 download. Pretty useless really :) ALL Yes USA USA
GigaShare 50Mb Unknown Video No USA Netherlands
UploadSend 50Mb Expires after 30 days of no use. ALL No USA USA
MyFileShack 25Mb Unknown ALL Yes Canada USA
YourFileHost 50Mb Removed after 14 days of no use. ALL No USA USA
FilesUpload 10Mb Unknown ALL Yes USA USA
Come2Store 10Mb Unknown ALL Yes USA USA
WebFileHost 10Mb Unknown ZIP, RAR No USA USA
FileCabi.net 10Mb Removed after 30 days of no use. Video No United Kingdom USA
PutFile 10Mb Removed after 6 months of no use. Video, SWF No United Kingdom USA
ZippyVideos 10Mb Removed after 60 days of no use. Video No Canada Usa
TheFileBucket Unknown Unknown ALL Yes Malaysia USA
Webbit Various Various, depends on which one you use. ALL No N/A N/A
Upload2 25Mb Removed after 1 year. ALL No Ukraine USA
UploadYourFiles Unknown Unknown ALL No Germany Germany
FileFactory 500Mb Removed after 30 days of no use. ALL No United Kingdom United Kingdom
Insightz 500Mb Removed after 30 days of no use. ALL No Canada USA
mysharebox 100Mb Removed after 30 days of no use. ALL No USA USA

Image Storage

Name Size Limit Registration
ImageHeap 10Mb Not Done Yet
ImageRage 5Mb No
WatoTron 5Mb Yes
FileHost 5Mb No
ImageXoom 4Mb No
PhotoServer 4Mb Yes
Xtra-Image 3Mb No
Uploadz 2Mb No
FreeImageLibrary 2Mb Yes
MyPhotoHost 2Mb Yes
TheFreeImageHosting 2Mb No
Uploadz4Free 2Mb Yes
XS 1.5Mb No
PicsPlace 1.5Mb No
WebDevote 1Mb No
SimpLoad 1Mb No
MyImageDepot 1Mb No
MyImageHosting 1Mb Yes
ImgSharing 1Mb No
ImageShack 1Mb No
UploadImages 1Mb No
PicVault 1Mb No
ThePictureBin 1Mb Yes
Iuploads 512Kb Yes
ImageSubmit 450Kb No
Sighost 300Kb Yes
RealImageHost 256Kb Yes
PhotoBucket 250Kb Yes
ImageDropBox 250Kb Yes
TinyPic 250Kb No
FreeFotoFile 200Kb Yes
ImgZone 100Kb No
SolisStyle 5Mb Yes
FotoPic Unknown Yes
GigaShare 2Mb No
PutFile 10Mb No
Desearch 4Mb No
Upload4Free 2Mb No

(See also freemail.htm)

Other stuff   

[Robot generated WWW-Indexes]   [WOLF]

[Amazon browser]   [CUI: Computer Science Library]   [Internet library for librarians]

[RFC search]   [FAQ search]   [FAQ search (Utrecht)]  

[RFC search] (Request For Comments archive: more than 3100 RFCs)

Enter the search string, select the type of search, and click on the Search button to initiate the search.



Search RFC Options

Note: These options only pertain to searching the RFC archive (Search RFCs - ...) and not to the "Search RFC Index". The "Show X references per file" option only applies when the "Search RFCs - Show References" search is selected.




http://www.faqs.org/faqs/: Internet FAQ Archives

Quick FAQ Search





http://www.cs.uu.nl/cgi-bin/faqwais: Utrecht CS News.Answers FAQ Access Methods

Keyword(s)


Searching the 'invisible' web


§ This section is obsolete. For a more recent treatise of these matters, confer deep web searching


The main search engines cover -at best- one third of the web. This is due to the fact that a lot of information on the web is not actually sitting on the web as we know it. Data are stored in formats other than HTML, and as such cannot be indexed by spiders and crawlers. Though many databases can be searched through a internet html-interface, the data themselves sit outside the web, and are linked "on the fly" via PHP, PERL, Javascript, CGI or any other scripted programming language you can fancy.
This is true for the result pages of the search engines themselves, btw: every time you perform a query the search engine send back SERPs to your browser, formatted into HTML on the fly by a scripted program. Those specific SERPs you called into html-life did not exist before and will disappear again the moment you move on: redoutable powers of the nethervoid.

The 'invisible web' is thus made mostly out of huge commercial and non-commercial (scientific, educational, legal) databases.
Moreover there are specific databases that are not fully indexed: repositories of phone numbers; products catalogs; lairs of e-mails, credit card numbers & addresses; collections of dictionaries, vocabularies, thesaurii, electronic books and journals, bulletin boards, mailing lists, and so on.
Some of these 'invisible' databases are password protected. Obtaining access could involve a moderate amount of work.


[Complete planet]
"Discover over 70,000+ searchable databases and specialty search engines"
Topics:
Agriculture Games & Hobbies Military Religion Arts & Design Government Music Science Business Health News Search Engines Computing & Internet Home & Garden Newspapers Shopping Education Humanities People Social Sciences Energy Jobs & Careers Places Sports Engineering Law Politics Transportation Environment Literature Products & Technology Travel Family Living things Recreation Weather Finance & Economics Magazines & Journals References Food & Drink Media & Entertainment Regional


[Info Mine]
"Infomine is a virtual library of Internet resources relevant to faculty, students, and research staff at the university level. It contains useful Internet resources such as databases, electronic journals, electronic books, bulletin boards, mailing lists, online library card catalogs, articles, directories of researchers, and many other types of information. Infomine is librarian built. Librarians from the University of California, Wake Forest University, California State University, the University of Detroit - Mercy, and other universities and colleges have contributed"




[Academic Info]
"Academic Info is an online subject directory of over 25,000 hand picked educational resources for high school and college students as well as a directory of online degree programs"

Search for:    






Local search: valuable pointers


Hitbox (now yep)
Warning!: Advertisement & Akamai webbuggers infested
search model: http://www.yep.com/cgi-bin/search.cgi?criteria=%22how+to+search%22&x=28&y=13



Dejavue
Dejavue does not exist any more: it has been acquired by Google


Usenet repository: a dynamic repository of practical wisdom with access to more than 40,000 online discussion forums

NEW ADVANCED GOOGLE USENET MASK


Erolinx


  More Options
Don't underestimate smut-search engines as learning tools! Go to the Webmasters / chatboard & lounges part of this smut search engine in order to learn how these idiots are trapping lusers to their ads-infernos...

iDexer Adult Search

Don't underestimate smut-search engines as learning tools! ...

Acronim finder

Type in the acronym to search for without periods or quotes (example: ASAP)
Cut and paste following example (and edit):
http://www.acronymfinder.com/af-query.asp?Acronym=ASAP&String=exact&search.x=63&search.y=9


OLGA Search Engine
(courtesy of P.Cook)
If searching for a song or a band, simply enter the name. Eg 'dont think twice' 'pearl jam'.
If searching for an individual artist, (or a band named after an individual artist) enter the last name (and then, if you want, the first name.) Eg 'cohen leonard' 'piaf edith' 'conte paolo'.
  Tabs   Bass Tabs   Chords   Chordpro      Hits Per Page:
Artist/Group/Title:

AOL






The following three russian pointers have been ported here from a closed pagetool of mine: they are quite useful in order to find a lot of interesting stuff (and Bilibin's images). Enjoy!
Rambler's top 100


For instance mp3: http://top100.rambler.ru/top100/MP3/index.shtml.ru

Or free soft and web tools... http://top100.rambler.ru/top100/Software/index.shtml.ru

Ïîèñê (search)            [Advanced search]
Help

Top100  Web at large

Spylog's top


Search spylog 

1000stars' top


    


Webrings searching

How important this can be! A thousand "voluntary operators" at your disposal!
See also
[The importance of Webrings for combing purposes]. You'll use this approach again and again once you'll have understood the limits of the 'main' search engines :-)



World of webrings

http://www.webringworld.org/: World of Webrings
"This web site is the result of the collaborative effort of some experienced ringmasters engaged in the webring community"

http://www.webringworld.org/modules.php?op=modload&name=Search&file=index&mid=15: Webringworld's search function (bad php, does not work properly on good browsers, you may have to use some MSIE-crap for this).


RingSurf Directory

The RingSurf Directory search engine is SLOW, be patient...
Search
in:
for:
   Family Filter Off:
among: Keywords Title Description


Yahoo's webrings

Yahoo has 'fagocitated' various webring centrals, but you can still search Yahoo's webrings!


Crickrock rings


http://www.crickrock.com/ringlist.html 'crickrock' rings:
"Face it, Yahoo! screwed you and your webring when they acquired webring.org. Move to CrickRock an d solve your problems"


Ringlink

http://www.ringlink.org/: the Homepage of Ringlink. Ringlink is ...
  1. The Ringlink Program - A Perl program that provides the tools you need to run one or more webrings, i.e. systems of links between related web sites.
  2. The Ringlink Project - An open source project aiming to further develop the Program, and support its use.
http://ringdir.ringlink.org/: Ringlink Webring Directory ("Webrings are a fun and easy way to find groups of websites with related content"

most popular 50



Bravenet 'site rings'

http://www.bravenet.com/sitering/search.php

   Keyword Search:   Category Search:  




Weblogs searching

Weblogs... even more blah blah blah you could possibly fear to waste your web-life with :-)

Check the ad hoc blogs section.



Recall searching

Recall is, strictly speaking, no real search engine. But it gives you the possibility to check the spreading and the fortunes of a given 'term' on the web.
So it is, after all a local search engine.

April 2006: DOWN :-(

Recall BETA   After before About Recall  Feedback



Homepages searching

Use our [scrolls]!

Woah! Say... have you ever tried some "homepage fishing"? Have a go and lemme know what you think...
Freepage providers are of course pavid commecial bastards, that you should avoid at all cost for real publishing purposes... BUT NOT for searching purposes. Infact these clowns will ruthlessly use everything you post, they will track (cowardly & hiddenly) any visit to your site, and at the same time they will be pleased to nuke your site -just in case- as soon as they realize that you are offering for free something that they would like to sell... yet they are not very clever, Gott sei dank, and their grepping bots are quite simple & silly (or worse). Therefore ~S~eekers can still find -through these very "free" sites' specific search engines- a lotta "hidden" sites with valuable (if scattered) info... yet you better understand the
[evaluation] lore first


homepage searchingengines: hpsearch.unitrier.de


An interesting trick (by sonofsamiam) in order to rank, for instance, all geocities pages (are they really 132150?) through the following querystring:
http://us.geocities.yahoo.com/search?p=a+b+c+d+e+f+g+h+i+j+k+l+m+n+o+p+q+r+s+t+u+v+w+x+y+z+_+0+1+2+3+4+5+6+7+8+9&o=o&h=s
Note the rank order... and note that we loose only relatively few pages (less than 4%!) if we omit the numbers in our searchstring:
http://us.geocities.yahoo.com/search?p=a+b+c+d+e+f+g+h+i+j+k+l+m+n+o+p+q+r+s+t+u+v+w+x+y+z&o=o&h=s
this "number rarity" (only 7% of the pages: 9242) proves once more how PREDICTABLE the stringpatterns are inside webpages...

Xoom

   Other Xoom masks
 Search tips

Tripod (& Angelfire)

Search     

Search Tripod on "how to search":
http://www.tripod.lycos.com/bin/search/search_tripod?query=%22how%20to%20search%22

Search Angelfire (connected to Tripod) on "how to search":
http://www.angelfire.com/cgi-bin/search?first=6&maxhits=20&catalog=Angelfire&query=%22how%20to%20search%22

Fortunecity

   
Fortunecity is a collection of free pages, launched in November 1996, transformed from a UK-based (London) company into a full-fledged international operator with main strongholds in UK, Sweden and Germany. Bought recently France's Citeweb. 3,5 million sites, growing.
Fortunecity Searchstring

http://www2.fortunecity.com/apps/Search/Search?searchterms=%22how+to+search%22&searchtype=fc%3AFC_index

Geocities

http://www.geocities.com/cgi-bin/search/isearch The search engine is an Inktomi-type. Geocities pages are often used as 'repositories' for files you may be searching for (often with faked endings to avoid sysads snooping): mp3 files abound.
Crosswind

Keyword Search for Crosswind

Search for:



Other stuff: Journals


(Note that this part of searchlores overlaps with the ad hoc section journals.htm


Directory of Open Access Journals
http://www.doaj.org/: Directory of Open Access Journals. This service covers free, full text, quality controlled scientific and scholarly journals. We aim to cover all subjects and languages. There are now 2209 journals in the directory. Currently 604 journals are searchable at article level. As of today 95820 articles are included in the DOAJ service.

http://arxiv.org/
http://arxiv.org/: Open access to 365,819 e-prints in Physics, Mathematics, Computer Science and Quantitative Biology





Amazon browser (mod. entry: 05.2006)

Amazon browser "A tool for browsing the mass of literature, music and film contained in the Amazon database, by exploring links between related items"

Touchgraph style tool

Internet library for librarians (mod. entry: 05.2006)

http://www.itcompany.com/inforetriever/: Internet library for librarians,
"A Portal Designed for Librarians to Locate Internet Resources Related to Their Profession", very americanocentric.


CUI: Computer Science Library (mod. entry: 05.2006)

CUI: Computer Science Library: (Centre Universitaire d'Informatique, Uni Genève),
"This database lists all the publications in the CUI's Computer Science library."

http://www.unige.ch/search/welcome.php: WWW.UNIGE.CH, search mask
ressSelectionMaths.html: ressources thématiques en MATHÉMATIQUES
http://atoz.ebsco.com/Home.asp?id=4390: Répertoire des périodiques électroniques



Petit image
Bk:flange of myth
(c) III Millennium: [fravia+], all rights reserved, reversed, reviled, revelled, revolted