 |
| RABBITS & WEBBITS
Magical searching
1st published @ searchlores.org in November 2002
(This version 1.41 was updated in March 2006)
Catching the rabbit's ears & pulling files out of the hat
Advanced Web searching tricks
by VVAA (Various Authors) Intro by fravia+
|
Introduction (legal rabbish) |
Can "explaining people how to search files that are on the web" be constructed as "illegal" by some nasty clown?
Since I respect even the most stupid laws, this would really annoy me.
Let's have a look at the EU-directive on "copyrights" (read
patents, please always remember that
"intellectual property" and "copyrights" are only cheap and rather sarcastic euphemisms for patents): Directive 2001/29/EC of the European Parliament and of the Council of 22 May 2001
on the harmonisation of certain aspects of copyright and related rights in the information society (just change the EN
in this URL
http://europa.eu.int/eur-lex/lex/JOHtml.do?uri=OJ:L:2001:167:SOM:en:HTML to any other of the
EU languages to read it translated in your own language, here the relevant codes: en, fr, de, it, nl, es, el, pt, sv, da, fi,
cs, et, lv, lt, hu, mt, pl, sk, sl).
This appallingly restrictive EU Directive is clearly a
law designed only in order to protect the interests of large media organisations, still they couldn't exagerate so much
that they would have given away their true aims. When "consumers' rights" "threaten the industry future" you cannot
simply state that you want to screw consumers' rights in order to protect your paymasters.
In the introductory explanations (22) we can read: "The objective of proper support for the dissemination of
culture must not be achieved by sacrificing strict protection
of rights or by tolerating illegal forms of distribution
of counterfeited or pirated works". This is interesting, albeit -of course- very wickedly expressed.
So we note that the real objective to achieve is NOT "strict protection
of rights" or "tolerating illegal forms of distribution
of counterfeited or pirated works", the real objective to achieve, as stated in this
text, is "proper support for the dissemination of
culture", albeit in achieving it we shold -admittedly- not make excessive sacrifices.
For the rest, reading the text of the law, we see (art.3) that one should not make "available to the public... [patented] ...works in such a way
that members of the public may access them from a place and
at a time individually chosen by them] and this we do not. We do not make files available, they are already
available. We just point out that they are available everywhere, and,
incidentally, this may even help those very bastards patent-holders and their political lackeys
to track many repositories of patented files... those clowns may infact learn something as well.
Also we note with proud and sense of accomplishment that the objective of "proper support for the dissemination
of culture" is achieved -inter alia- also through our webbits.
I can breath again.
Such is the advantage of knowledge: Fac sapias, et liber eris.
CATCHING WEB-RABBITS (WEBBITS)
|
Da structurz of da rabbitz (de structura et usu lepris)
warez rabbits
pr0n rabbits
Serialz rabbits
Gamez rabbits
Another way of getting them webbits out?
mp3 rabbits
DVD & Media rabbits
Mordred's asterisks
bookish rabbits
newsfeedish rabbits
email gathering rabbits
proxy rabbits
Rabbits for rabbits
Cameras and Cams webbits
Other roadkills
People that are new to our webbits wont believe the techniques described
here until they try them... so just try them :-)
Those that already know how powerful and mighty Seekers
can be, will not wonder at all.
Both kind of readers will
find these tricks more useful than my words can possibly underline.
Hey! This section is mostly for advanced seekers, there's shouldn't be any need to explain too much... either you
do understand everything, and then some,
just looking at the examples listed below, or you better
come back here later... or never again.
Yet I'll still begin with a short explanation: homines, dum docent, discunt :-)
Da structurz of da rabbitz (de structura et usu lepris) |
I'll use as a first example a mp3-searching webbit. I repeat: a mp3-searching, not a mp3-stealing, webbit.
Seekers don't care much for the various copyright and patent crap
obsessions of the powers that be and of their political lackeys and servants
because seekers -in fact- NEVER STEAL NOTHING. They don't need to: they find, and
in so doing just point out how easy it is to share and bypass any silly commercial restriction
on a web MADE in order to share knowledge: our web. The web was made for sharing, not for hoarding and not for selling.
Hence la va sans dire
that real seekers do not need to possess copyrighted music (nor anything else), they just
taste the fruits without
ruining the plants.
We are forced then to conclude that nature in constituting the seeker made it near and dear
to itself; for so it comes to repel from him all that is injurious and give him free access to all that is serviceable or akin to it.
As ole Diogenes pointed out long ago: possessions do not make you vulnerable unless you become attached to them :-)
To live in accordance with the
web is the Seekers' conception of the good for man.
The seeker does
not need to hoard on his hard disks whatever he has found on the web: all the various images, musics, films, books and
whatsnot that he fetches on the web... he can just taste, use, evaluate and leave there what he finds, without any need to copy it or
to "possess" it,
because he knows that nothing can disappear any more:
once anything lands on the web, it will always remain there, copycatted in a zillion
places and available for the eternity to all those that know how to search :-)
But in order to search effectively, in order to quickly find his targets wading among the commercial morasses
and the stinking spam lagoons, he needs a quiver of sharp arrows, and his webbits, as you will now see, are
among his most useful tools.
Enuff, let's begin.
The structure of the following MIGHTY mp3-webbit, that I recently presented in Berlin
has some interesting characteristics, that may be used to exemplify general webbits' structures and purposes.
| |
1
| 2
| 3
| 4
| 5
| 6
|
| High precision |
aqua
| barbie
| mp3 OR ma4 OR ogg
| intitle:"Index of"
| -metallica
| +"4.6M"
|
| High recall |
lavigne
|
| mp3 OR ma4 OR ogg
| intitle:Index.of
| -beatles
| +"4.4M"
|
| |
group
| title
| format variants
| index of in title
| spamkiller
| variable parameter (guarantees length)
|
-
The group or singer name is mandatory.
-
Adding your target's title can help if you encounter too much noise.
-
The format variants will guarantee a broader spectrum. If the search engine you are using is heavily censored (as
it more and more happens) just eliminate the mp3 parameter. Chances are that in the mp3 lists there will be some
uncensored
ma4 or some ogg file that will allow their retrieval nevertheless.
-
The intitle="index of" (or intitle:index.of, which is the same and avoid two keypresses :-) is mandatory, and
still allow fairly decent results, the spammers notwithstanding. Of course the intitle: operator is to be used with
google, check the different operators for the oher search engines, or just use the simple "index of" string snippet.
-
The -metallica (or -beatles, or whatsnot) serves as a spamkiller, because many clowns still try
to fish zombies out of the real web uploading huge lists of groups' names. If you'r going for high recall, then re-launch your query with a different
group acting as spamkiller.
-
Finally we come to the LENGTH parameter, which not only guarantees the
presence of at least some Megabyte heavy mp3, thus cutting away all the irrelervant noise of those
bogus "index of" spammer sites with "small snippets" of music, but also can be
varied ad libitum and will thus guarantee you hours of fishing pleasure. You may try for instance all variations
in the range 1.6M - 6.4M. I suggest starting in the band 3.5-4.5 (which is a good signal ratio for mp3s) and
moving upwards if you'r an optimist and downwards if you are a pessimist.
And now the gentle reader is invited to understand on his own the reason d'être of the following webbits...
Old & recent "warez" rabbits/webbits ('catching databases from below') |
http://www.google.com/search?hl=en&ie=UTF-8&q=%22parent+directory+%22+%2Fappz%2F+-xxx+-html+-htm+-php+-shtml+-opendivx+-md5+-md5sums
Old & recent "gamez" rabbits/webbits
|
http://www.google.com/search?hl=en&lr=&ie=UTF-8&q=%22parent+directory+%22+Gamez+-xxx+-html+-htm+-php+-shtml+-opendivx+-md5+-md5sums
"parent directory " inurl:games -xxx -html -htm -php -shtml -opendivx -md5 -md5sums
Old & recent "prOn" rabbits/webbits
|
(see also the more
"in-depth" pron searching at the
classrooms as well)
I have received some legitimate critics, since the 'porn search' was eminently 'masculine' oriented.
Hence now female readers may also
enjoy some of (or maybe all) the 'searcharrows' listed below...
Searching entries 'around the web', no specific target, using 'common' passwords:
For instance: bob:bob
Searching entries to a specific site (not necessarily pr0n, duh):
For instance: "http://*:*@www" supermodeltits
And now some 'choose the right bait' arrows:
The "moaning & whimpering & whatsonst" trick:
http://www.google.com/search?hl=en&lr=&ie=UTF-8&oe=UTF-8&as_qdr=all&q=%22she+made+*+noises%22&btnG=Google+Search
...of course you may go for a specific target word as well:
http://www.google.com/search?hl=en&lr=&ie=UTF-8&oe=UTF-8&as_qdr=all&q=%22she+made+muffled+noises%22&btnG=Google+Search
The more fantasy/linguistic knowledge you
apply,
the better (potentially) your results.
The "between her legs" trick:
http://www.alltheweb.com/search?avkw=fogg&advanced=1&cat=web&jsact=&_stype=norm&type=all&q=%22between+her+legs%22
Our female (or female inclined) readers will probably appreciate the following query, instead:
http://www.google.com/search?hl=en&lr=&ie=UTF-8&oe=UTF-8&as_qdr=all&q=tender+%22caressed+her%22+-brutal&btnG=Google+Search
Here an even more "female-softporn" oriented querystring:
http://www.google.com/search?hl=en&lr=&ie=UTF-8&oe=UTF-8&as_qdr=all&q=tender+%22caressed+her%22+-brutal+%2Bromantic&btnG=Google+Search
You may also try some "japanese" pr0n search strings:
shibari
koonago
bukkake
Another, more recent, 'funny' pron webbit (use your own fantasy/phantasms for more, err, interesting results, duh):
"blue bikini" "Apache" index.of
Uhmmm. Is this a "mature" audience?
intitle:index.of +playmates -hot -"free pics" -filetype:htm -filetype:html
Or, if you want something more vulgar,
(blnd3 OR blonde3 OR blond3) intitle:index.of
...the only limit to such arrows is, as always, your own creativity :-)
Besides the difference between pr0n and art is always in the eye of
the beholder: Gustave Courbet The Origin of the World.
Another, msnsearch related "index of" webbit:
Nov-2005 intitle:"Index of /" {frsh=9999} , for isntance Nov-2005 intitle:"Index of /" {frsh=9999} "digital photography"
Email gathering rabbits/webbits
Very dangerous openings for malpeople & spammers & phisers:
For instance:
http://www.google.com/search?hl=en&as_qdr=all&q=%22vbulletin.sql%22+%22DROP+TABLE+IF+EXISTS%22&spell=1
or, even better,
http://www.google.com/search?as_q=vbulletin.sql&num=10&hl=en&btnG=Google+Search&as_ft=i&as_qdr=all&as_occt=url&as_dt=i
or, even better,
http://www.google.com/search?hl=en&lr=&as_qdr=all&q=%22DROP+TABLE+IF+EXISTS%22&btnG=Search
And then you get email adresses, real addresses and whatsnot. Here an example found in December 2004, preserved coz things disappear
on the web.
Proxy rabbits/webbits
A funny one: "Yeahh, this proxy is working fine!"
Another one: ""*.*.*.*.*:80" "*.*.*.*.*:8080" "*.*.*.*.*:3128""
http://www.google.com/search?hl=en&lr=&ie=UTF-8&q=%22%2A.%2A.%2A.%2A.%2A%3A80%22+%22%2A.%2A.%2A.%2A.%2A%3A8080%22+%22%2A.%2A.%2A.%2A.%2A%3A3128%22+&btnG=Search
Rabbits to find other rabbits
http://www.google.com/search?num=30&hl=en&lr=&safe=off&q=%22visa+4356000000000000..4356999999999999%22&btnG=Search
Sort of "Metawebbit', by il-li :-)
DVD & Media rabbits/webbits
http://www.google.com/search?hl=en&lr=&ie=UTF-8&q=%22parent+directory+%22+DVDRip+-xxx+-html+-htm+-php+-shtml+-opendivx+-md5+-md5sums
http://www.google.com/search?hl=en&lr=&ie=UTF-8&q=%22parent+directory+%22Xvid+-xxx+-html+-htm+-php+-shtml+-opendivx+-md5+-md5sums
Serialz rabbits (the cereal killer)
(fisher fritz fished frish fish when fishing fish with fish)
Small FAQ
Q: what is the main problem on the web? A: avoiding commercial idiots.
Q: when can you be sure that
you are not dealing with commercial idiots?
A: When you actually find a site that delivers something, instead of only asking for money/karma/patience/bad taste.
Q: In the case of serials, how do you avoid commercial idiots? A: Using a working serial as a bait, if possible
related with your serial search.
Yep, the moral of the story is: if SOME results are there, then the site cannot be
that bad... and here we have the famous xcx8-19x0, of C++ Builder V4 fame... of course it would also work with
100-000-0427, for the same reasons. Try SFJHTKJO (cache) 60D015F4-600893EB-BEFF5755-53E4E943 (CD rom),
NOV30195MAY261951 (MAC) etcetera...
Small hints
1: Always use quotes, duh. 2: Use other serials,
not the ones above, duh... as vvf pointed out, if you use these "stale" cereals you will just get a lot of "historical" pages. 3: Buy your really useful appz.
Da cereal killer:
http://www.google.com/search?as_q=%22xcx8-19x0%22&num=10&hl=en&btnG=Google+Search&as_epq=&as_qdr=all&as_occt=any&as_dt=i
Another one:
http://www.google.com/search?hl=en&lr=&c2coff=1&q=Waver+1.52+Waver+2.04+Waver+2.05+Waver+2.70++Waver+2.80+Waver+2.81++Waver+2.85&btnG=Search
Serialz knowledge
Q: Why do we have so many software versions, serials, numbers, apparent evolution? I mean, why do we have version 2.0.16 and, two weeks later, version
2.0.17 of some crap app?
A: Coz as soon as slow clumsy protectors devise a protection,
nimble beautiful crackers find a clever way to defeat it. And seekers find both: the appz and its nemesis.
Q: But most serialz/cracks refer to older versions of the appz. And these are no more on the web, at the moment
you can download only the last uncracked version!
A: Nope. ALL versions of ANY app are on the web, the most recent ones (not yet in the shops) and the older
ones (even the programmers forgot them). If you learn how to search you will find all of them.
Besides, as ole +ORC pointed out many years
ago, PC-MAGAZINES have published year after year all the older versions...
and you can buy 20 kilos old PC magazines with their CD-roms for a couple
of euro. A treasure of already serialized/cracked working older appz and some great cracking 'textbooks'
at the same time :-)
Warez in google groups
(special thank to Spiro Conomos for this)
in Google Groups, advanced, "past 24 hours" use the query:
+("ftp://" OR "http://") +("*.zip" OR "*.exe" OR "*.rar") -link group:*crack* OR group:*fido7* OR group:*2600*
http://groups.google.com/groups?as_q=%2b%28%22ftp%3A//%22%20OR%20%22http%3A//%22%29%20%2b%28%22*.zip%22%20OR%20%22*.exe%22%20OR%20%22*.rar%22%29%20-link%20group%3A*crack*%20OR%20group%3A*fido7*%20OR%20group%3A*2600*%20
...do not be TOO afraid of commercial idiots... (- ritz)
sometimes you just have to be bold and plainly ask for what you want.
bye!
Basta pasta! (enough google!) (Nemo)
*If* commercial idiots don't have a link to a file with the correct extension, how do they expect me to believe they know *where* is that damn file.. doesn't sound credible.. he
Solution: build your queries in a way that also probe *if* their claims are consubstantiated.
Example:
Another way of getting them webbits out? (1)
(catching non linked shadowy sites)
forums having warz'd stuff have rules of not putting active links, so:
http://www.google.com/search?q=%22no+active+links%22+games
Needs to be refined, though.
Another way of getting them webbits out? (2)
(warez' hybris)
Based on the fact that most of the warez groups like to trumpet the fact that they're actually making something available for download ;)
http://www.google.com/search?q=%22tagged+and+upped+by%22
Another way of getting them webbits out? (3)
(anchor trick)
One of Altavista's most SPECIFIC features is the anchor: operator, which will allow
patient searchers to find relevant pages trough tha anchor tag.
For instance: anchor:snowflakes or
anchor:posette or
anchor:beria or
anchor:kafka will give you a
series of noise
reduction arrows...
of course you can extend the trick to whatever...
anchor:warez or
anchor:gamez or
anchor:whatever :-)
New mp3 rabbits (catching "streaming" mp3 at the source)
First of all, a little history: this was a
classic "old" mp3 rabbit-lure:
+"index +of/mp3" +dylan
Ok, it works, not bad either... but that's NOTHING compared to the following magical query:
Clever "Andromedian" rabbits (prototype, by TBD): "search
andromeda" "play all"
Refining the mp3 prototype...
http://www.google.com/search?hl=en&lr=&ie=UTF-8&oe=UTF-8&safe=off&q=%22doors%22+%22powered+by+Andromeda+version%22&btnG=Google+Search
"doors" "powered by Andromeda version"
"vivaldi" "powered by Andromeda version"
http://www.google.com/search?q=%22search+andromeda%22+tolkien&num=25&ie=utf-8&oe=utf-8
You dig it? And this can be applied, mutatis mutandis to any other authomated retrieval/distribution
system. See the discussion at the [the ~S~ Seekers' msgboard]
And now a december 2002 addition to the "Andromeda" trick, by ronin:
mp3 searching - generalisation from andromeda's trick (14/12/02 04:28:57)
To fish some equivalent solution :+"Dynamic MP3 Lister" +andromeda
http://www.hotscripts.com/PHP/Scripts_and_Programs/Music_Libraries/
http://php.resourceindex.com/Complete_Scripts/Audio_Management/
~
- Andromeda
SE Anchor : "powered by andromeda version"
- Dynamic Mp3 Lister
SE Anchor : "Dynamic MP3 Lister"
even better: "Dynamic MP3 Lister - Listing MP3s in"
- Mp3 Links
SE Anchor : "Catalog powered by: MP3Links"
- Ampache
SE Anchor : "Welcome to Ampache v."
- kplaylist
SE Anchor : "Notice! This is a non public website. All actions are logged."
Password gate
- GNUMp3d
SE Anchor : intitle:"GNUMP3d" subdirectories
ronin
All the damned porn sites now have incorporated
the "index +of" "parent directory" string right into their keyword Traps now - as i guess you've seen
Yes, MP3-Searching in the web ain't no real fun anymore ;)
you can bypass SOME of the fake Traps by using just a few more keywords to get to more legitimate indexs
try
"index +of" "last Modified" "size" "description" descriptive keyword (mp3?)
by Jeff, February 2004
You may try as well:
http://www.google.com/search?hl=en&lr=&ie=UTF-8&q=%22parent+directory+%22+MP3+-xxx+-html+-htm+-php+-shtml+-opendivx+-md5+-md5sums
Another MP3 rabbit (by il-li)... "Apache/1.3.29 Server at " mp3:
http://www.google.com/search?num=30&meta=hl%3D%26lr%3D&q=%22Apache%2F1.3.29+Server+at+%22+mp3
...modify version number as needed :)
And a classical old one: 'parent directory' +mp3 +dylan, now almost useless...
http://www.google.com/search?as_q=+%27parent+directory%27+%2Bmp3+%2Bdylan&num=100&hl=en&btnG=Google+Search
...unless you repeat it with a "non-google" search engine :-)
'parent directory' +mp3 +dylan @ yahoo (third page)
A simple music searching approach (using as before m4as, that are less censored than mp3s):
(ma4 OR mp3) "index of" +garfunkel
For instance:
http://violentnation.com/fipilele/
And you'll also land inside this huge mp3 pasture, so big that it may crash evena mighty browser (try it with firefox and it will
probably grind)...
http://24.91.184.80/jserver/files/music/
Also:
intitle:"index of/" "last modified" "mp3" lady madonna,
And at Baidu: mp3 "Port 80"
(for instance: mp3 "Port 80" eminem).
A more complex mp3 "klebing" webbit:
allintext: ma4 OR mp3 OR ogg "icons/sound2 gif"
See? Now you'll have to "peel the URL-onions", backwards, towards the correct targets.
Whenever you need some specific music, once again, just fetch it from the web at once.
Using a simple string
intitle:"index of/" "last modified" "mp3" garfunkel
or a "local" one:
intitle:garfunkel site:geocities.com
or a, ahem, "creative" one:
intitle:fuckriaa garfunkel, q.e.d. ~
Hey, who doesn't enjoy downloading some crispy, fresh mp3s at home, tired and aching after a
long day's work? Here a magic, nice querystring, just smash it inside google:
dylan "snd *.mp3 *-*-2005 *:* *.*m" OR "snd *.mp3 *-*-2005 *:* *.*k" OR "snd *.mp3 *-*-2005 *:* *.*"
and obtain the following link: http://www.google.com/search?hl=en&lr=&ie=ISO-8859-1&q=dylan++%22snd+*.mp3+*-*-2005+*%3A*+*.*m%22+OR+%22snd+*.mp3+*-*-2005+*%3A*+*.*k%22+OR+%22snd+*.mp3+*-*-2005+*%3A*+*.*%22++&btnG=Search
Change dylan to garfunkel, mendelssohn, mozart or whatnots. Change 2005 to 2004 (or earlier) for a different (but possibly more stale) search.
As usual, phantasy is the secret weapon of the seeker
More "index of" tricks
("Mordred's asterisks")
A trick to make google highlight important parts of the summary with careful usage of asteriscs. For example if we're looking for a big (as possible) recording of rain, we'd use the "index of" trick like that:
"index+of/" "rain.wav"
60 results - but... which one to choose?
Here is a
better way to gather relevant info:
"index+of/" "rain.wav******"
Index of /rmx/impregnation
Index of /rmx/impregnation. ... cymb.wav 14-Mar-2003 08:14 975k hh.wav 14-Mar-2003 08:14
780k kick.wav 14-Mar-2003 08:14 780k rain.wav 14-Mar-2003 08:15 4.2M sample1 ...
chronofixion.free.fr/rmx/impregnation/ - 4k - 26 Mar 2003 - Cached - Similar pages
Mmmm... the six asterisks put the info we need in bold. See the size? 4.2M :)
Books - rabbits (Jeff's
quick "book search" tip)
Check the ad hoc how to find books section
(See also the classrooms
books & books & dark riders
and
Cat burglers in the museum after dark)
Some useful 'rapidshare-alike' webbits (see: here)
Here some simple "repositories" webbits:
For pdf files: pdf "rapidshare.de/files" site:rapidshare.de
For movies/video files: +inurl:avi|mpg|wmv site:rapidshare.de
For mp3 files: +inurl:wma|mp3 site:rapidshare.de
For software/zipped files: +inurl:exe|rar|zip site:rapidshare.de
Replace rapidshare.de with megaupload or anyone of the repositories that you will find here in order to search for files available on different servers.
The following comes from a thread first discussed on the
[~S~ Seekers' msgboard]
~S~ Jeff has given to the seekers community so much that
it would be hard to find something great enough to tank him. Here, if you take the time to understand the following,
you will find a short 'compendium' of the "how to find a book on the web" lore.
Re: book search strategies?
"I am making this post after a search for:
Genius The Natural History of Creativity
written by
Dr. Hans Eysenck
Very poor results, but I did not try everything yet.
Suggestions for this or any book search?"
sometimes "too much" is "too little"...
i begin with your words ...
Dr. Hans Eysenck 1910 returns
ok the guy is there ... what do I want? something written BY him
"by Dr. Hans Eysenck" otoh only 3 returns
regroup-rethink ... too much is too little
by Hans Eysenck 6,670 returns --- ok back on track now
filter alittle more ... Genius 512 ... and I see your full book title ... I could take a different path here now or keep on with this one ... I decide to take the Y in the road
is it online??? lets ask
"full text" "Genius The Natural History of Creativity" returns 4
i only looked at the first return ... seems to indicate a full text ... asks for your proquest login ... ah so now we know how we can get a full text Welcome
many ways to skin a cat ... problem is catchin it
jeff
Except: Jeff's search did not work out:
Just because Google says it IS doesn't mean it's SO. We must remember to check our results!
(This happens to Jeff sometimes... that's okay, we all make mistakes)
But it does provide good tactics.
Also - it has already been posted here:
fu_speci.htm, and should be cross-linked...
(and it has been duly indexed by Google...)
- Stiletto
Another nice webbit:
-inurl:htm -inurl:html intitle:”index of” +(“/ebooks”|”/book”) +(chm|pdf|zip) +”For Dummies”
Try out (and understand) this arrow:
-inurl:htm
-inurl:html intitle:”index of” +(“/ebooks”|”/book”) +(chm|pdf|zip) +”For Dummies”
or maybe you prefer this one?
-inurl:htm -inurl:html intitle:"index of" +("/ebooks"|"/book") +(chm|pdf|zip) +"o'reilly"
Newsfeeds- rabbits (Nemo's
"translation" tip)
Those annoying news with signing in options
Can be easily bypassed... as Nemo points out:
Just thought I'd share this with you... While searching for some
info on google I stumbled across a NY Times article. But when I opened
the article all I got was the sign in box. darn. So I thought, try
google cache, but they were clever enough to add a nocache directive.
No luck there.
But the google snippet had some real text from the article, so I know
they are serving google something different than me. trying to make my
mouth water by getting the pages included in the google index but snatching
away the cake before I can eat it!
What the boys didn't think of though, was the google translation tool.
Simply translate the page from X to English and voilla you get served
the cloaked content! I guess NY Times are going to stuff that hole one day, but
in the mean time it makes research so much easier!
That's a neat trick! Probably it should also work for other (news) cloakers... hehehe
This trick is of course -simply put- genial. The idiots that want your data --or your money-- to access their articles
MUST INDICIZE those very "hidden" articles on the main search engines in the first time, else no lemming will
come and smear his data and/or money around. So the complete text must be SEEN by the search engines' bots.
Hence they MUST ALLOW access to the search engines' bots. The more search engines, the more bots... the better.
Google, Fast, Yahoo,
Teoma, you name them.
Hence if you -say- spoof your referral, and/or use a translation service and/or use another of the
many webbits tricks, you'r as good a reader as google
:-)
Try a New york times article
on the form below (but if you use, as you always should, Opera+proxomitron it may not
work... use --for this-- M$IE like the average zombies and you'll do the magic :-)
Cameras and Cams webbits
by Carsten, March 2005
inurl:"ViewerFrame?Mode="
intitle:"Live View / - AXIS" | inurl:view/view.shtml
inurl:"MultiCameraFrame?Mode="
intitle:"WJ-NT104 Main"
intitle:"snc-rz30 home" Requiere plugin Active-X.
inurl:"ViewerFrame?Mode=Motion" Requiere plugin Active-X.
inurl:LvAppl intitle:liveapplet
intitle:"Live View / - AXIS" Requiere plugin Active-X.
"Powered by webcamXP"
inurl:indexFrame.shtml "Axis Video Server"
"MOBOTIX M1" and "open menu"
intitle:flexwatch intext:"Copyright by Seyeon TECH Co"
Axis
axis video server view.shtml
/view/view.shtml axis
"adding live video to one of your own pages a very easy task with an AXIS 2100 Network Camera"
"Live view - / - AXIS"
"Your browser has JavaScript turned off.For the user interface to work effectively"
indexFrame.html axis
"Live web imaging unleashed"
Canon
sample/LvAppl/
MOBOTIX
/control/userimage.html
JVC
"(c)copyright 1999-2003 VICTOR COMPANY OF JAPAN, LIMITED. All rights reserved"
"V.Networks Motion Picture(Java)"
"Control the Pan/Tilt and move to the Preset Position"
FlexWatch
/app/idxas.html
"Saving & Retrieving Mode"
Panasonic
/ViewerFrame?Mode=Motion
TOSHIBA
"TOSHIBA Network Camera - User Login"
Sony
/home/homeJ.html
webcamXP (software)
"my webcamXP server!"
Other roadkill rabbits
[Daily Statistics] [Hourly Statistics] [URL's]
il-li
http://www.google.com/search?q=%22generated+by+webalizer+version%22&num=25&ie=utf-8&oe=utf-8
as diferrent versions have different categories:
[Daily Statistics] [Hourly Statistics] [URLs] [Entry] [Exit] [Sites] [Referrers] [Search] [Agents] [Countries] for example
This gives 75000 results, but many of them are monthly or other stats, so it's maybe better to provide a filter (~50000):
http://www.google.com/search?num=25&hl=en&lr=&ie=utf-8&oe=utf- 8&q=%22generated+by+webalizer+version%22+%22%5BDaily+Statistics%5D%22&btnG=Google+Search
A gameboy webbit:
http://www.google.com/search?q=1929+-+Rave+Master+Special+Attack+Force+(U)&sourceid=opera&num=0&ie=utf-8&oe=utf-8
Another one: http://www.google.com/search?q=1908+-+A+Sound+of+Thunder+(U)&sourceid=opera&num=0&ie=utf-8&oe=utf-8
Mordred
"not for distribution" confidential
http://www.google.com/search?hl=en&lr=&ie=UTF-8&oe=UTF-8&q=%22not+for+distribution%22+confidential&btnG=Search
intitle:"index of" site:de password
http://www.google.com/search?hl=en&lr=&ie=UTF-8&oe=UTF-8&q=intitle%3A%22index+of%22+site%3Ade+password&btnG=Search
of course it could be any other country but "de"
"Select a database to view" -google
http://www.google.com/search?hl=en&lr=&ie=UTF-8&oe=UTF-8&q=%22Select+a+database+to+view%22+-google&btnG=Search
http://www.google.com/search?hl=en&lr=&ie=UTF-8&oe=UTF-8&as_qdr=all&q=%22Enjoy+this+fine+release+from%22&btnG=Search
and so on... "eitelkeit-fishing"
Franz BeckenFisher
Foreign magazines' passwords
http://www.google.de/search?hl=en&as_qdr=all&q=url%3A*uni*+zeitschriften+passwort+lokal
Skybound
less noise with : +cracker +packager +protection +"release date"
http://www.google.com/search?as_q=+%2Bcracker+%2Bpackager+%2Bprotection+%2B%22release+date%22&num=10&hl=en&ie=UTF-8&oe=UTF-8&btnG=Google+Search
Not all groups use the "enjoy this fine release" sentence. And also, not only scene groups use it.
+filetype:nfo still works fine ;) (around 1K nfos indexed by google.)
Anonymous corrector
Some comments :
+"Select a database to view" +intitle:"filemaker" -google :)
http://www.google.com/search?as_q=%2B%22Select+a+database+to+view%22+%2Bintitle%3A%22filemaker%22+-google&num=10&hl=en&ie=UTF-8&oe=UTF-8&btnG=Google+Search
About the 'confidential' webbit, see : http://www.searchlores.org/loki_mil.htm
Ah, some others (there is no end..) : +password +config.inc +filetype:inc -cvs
http://www.google.com/search?hl=en&lr=&ie=UTF-8&oe=UTF-8&q=%2Bpassword+%2Bconfig.inc+%2Bfiletype%3Ainc+-cvs&btnG=Search
or +mysql_connect +filetype:inc -cvs
http://www.google.com/search?hl=en&lr=&ie=UTF-8&oe=UTF-8&q=%2Bmysql_connect+%2Bfiletype%3Ainc+-cvs+&btnG=Search
fisher
Funny rabbits
by Kane (September 2004)
intitle:liveapplet inurl:LvAppl
inurl:indexFrame.shtml Axis
intext:"MOBOTIX M1" intext:"Open Menu"
inurl:"ViewerFrame?Mode="
Linguistic rabbits
by fravia+ (September 2004)
intitle:glossary
and of course you may specify your target sector:
intitle:glossary "internet terms"
This is connected both with our Seekers' Linguistic Station and with our
eurosearch.htm regional approach...
Search engines rabbits
by fravia+ (October 2004)
Everything about google's datacenters and dances:
http://66.102.7.104/search?hl=en&lr=&ie=UTF-8&q=%2264.233.161.99%22+%2266.102.9.104%22+%2266.102.7.104%22+%22216.239.41.104%22&btnG=Search
Winall webbits
(Caleb & others, updated September 2005)
[parent.directory winall -xxx -html -htm -php -shtml -opendivx -md5 -md5su]
[winall.retail]
(c) III Millennium by [fravia+],
all rights reserved and reversed