Ok

By continuing your visit to this site, you accept the use of cookies. These ensure the smooth running of our services. Learn more.

« 美媒体猛批布什中国之行称其表现令人失望 | 首页 | 中国可能已有至少300人死于禽流感 »

星期三, 十一月 23, 2005

Google Search Tips 2005/古狗搜索技巧2005版

Google Search Tips 2005


Here are some search syntax basics and advanced tricks for Google.com. You might know most of these, but if you spot a new one, it may come in handy in future searches.

  • A quote/ phrase search can be written with both quotations ["like this"] as well as a minus in-between words, [like-this].
  • Google didn’t always understand certain special characters like [#], but now they do; a search for [C#], for example, yields meaningful results (a few years ago, it didn’t). This doesn’t mean you can use just any character; e.g. entering [t.] and [t-] and [t^] will always return the same results.
  • Google allows 32 words within the search query (some years ago, only up to 10 were used, and Google ignored subsequent words). You rarely will need so many words in a single query – [just thinking of such a long query is a hard thing to do, as this query with twenty words shows] – however, it can come in handy for advanced searching... especially as a developer using the Google API.
  • You can find synonyms of words. E.g. when you search for [house] but you want to find “home” too, search for [~house]. To get to know which synonyms the Google database stores for individual words, simply use the minus operator to exclude synonym after synonym (they will always show as bold in the SERPs, the search engine result pages). Like this: [~house -house -home -housing -floor].
  • To see a really large page-count (possibly, the Google index size, though one can only speculate about that), search for [* *].
  • Google has a lesser known “numrange” operator which can be helpful. Using e.g. [2000..2005] (that’s two dots inbetween two numbers) will find 2000, 2001, 2002 and so on until 2005.
  • Google’s define-operator allows you to look up word definitions. For example, [define:css] yields “Short for Cascading Style Sheets” and many more explanations. You can trigger a somewhat “softer” version of the define-operator by entering “what is something”, e.g. [what is css].
  • Google has some exciting back-end AI to allow you to find just the facts upong entering simple questions or phrases like [when was Einstein born?] or [einstein birthday] (the answer to both of these queries is “Albert Einstein – Date of Birth: 14 March 1879”). This feature was introduced April this year and is called Google Q&A. (See some of the various working Q&A sample queries to get a feeling for what’s possible.)
  • Google allows you to find backlinks by using the link-operator, e.g. [link:blog.outer-court.com] for this blog. The new Google Blog Search supports this operator as well. In fact, when Google’s predecessor started out as Larry Page’s “BackRub” in the 1990s, finding backlinks was its only aim! However, not all backlinks are shown in Google today, at least not in web search. (It’s argued that Google does this on purpose to prevent reverse-engineering of its PageRank algorithm.)
  • Often when you enter a question mark at the end of the query, like when you type [why?], Google will advertise its pay-for-answer service Google Answers.
  • There a “sport” called Google Hacking. Basically, curious people try to find unsecure sites by entering specific, revealing phrases. A special web site called the Google Hacking Database is dedicated to listing these special queries.
  • Google searches for all of your words, whether or not you write a “+” before them (I often see people write queries [+like +this], but it’s not necessary). Unless, of course, you use Google’s or-operator. It’s an upper-case [OR] (lower-case won’t work and is simply searching for occurrences of the word “or”), and you can also use parentheses and the “|” character. [Hamlet (pizza | coke)] will find pages containing the word (or being linked to with the word) “Hamlet” and additionally containing at least one of the two other words, “pizza” or “coke”.
  • Not all Google services support the same syntax. Some services don’t allow everything Google web search allows you to enter (or at least, it won’t have any effect), and sometimes, you can even enter more than in web search (e.g. [insubject:test] in Google Groups). The easiest thing to find out about these operators is to simply use the advanced search and then check what ends up being written in the input box.
  • Sometimes, Google seems to understand “natural language” queries and shows you so-called “onebox” results. This happens for example when you enter [goog], [weather new york, ny], [new york ny] or [war of the worlds] (for this one, movie times, move rating and other information will show).
  • Not all Googles are the same! Depending on your location, Google will forward you to a different country-specific version of Google with potentially different results to the same query. A search for [site:stormfront.org] from the US will yield hundreds of thousands of results, whereas the same search from Germany (at least if you don’t change the default redirect to Google.de) returns... zilch. Yes, Google does at times agree to country-specific censorship, like in Germany, France (Google web search), or China (Google News).
  • Sometimes, Google warns you about its results, especially when they might seem like promoting hate sites (of course, only someone misunderstanding how Google works could think it’s them promoting hate sites). Enter [jew], and you will see a Google-sponsored link titled “Offensive Search Results” leading to this explanation.
  • For some search queries, Google uses its own ads to offer jobs. Try entering [work at Google] and take a look at the right-hand advertisement titled e.g. “Work at Google Europe” (it turns out, at the moment, Google Switzerland is hiring).
  • For some of the more popular “Googlebombed” results, like when you enter [failure] and the first hit is the biography of George W. Bush, Google displays explanatory ads titled “Why these results?”.
  • While Google doesn’t do real Natural Language Processing yet, this is the ultimate goal for them and other search engines. A little What-If Video [WMV] illustrates how this could be useful in the future.
  • Some say that whoever turns up first for the search query [president of the internet] is, well, the President of the internet. (I’m applying as well, and you can feel free to support me with this logo.)
  • Google doesn’t have “stop words” anymore. Stop words traditionally are words like [the], [or] and similar which search engines tended to ignore. Sometimes, when you enter e.g. [to be or not to be], Google even decides to show some phrase search results in the middle of the page (separated by a line and information that these are phrase search results).
  • There once was an easter-egg in the Google Calculator that made Google show “42” when you entered [The Answer to Life, the Universe, and Everything]. If I’m not mistaken, this feature has disappeared and now displays a more reasonable (but less funny) definition of the concept of Douglas Adams’ galactical joke. As I’ve been alerted in the forum, the easter egg only works lower-case.
  • You can use the wildcard operator in phrases. This is helpful for finding song texts – let’s say you forgot a word or two, but you remember the gist, as in ["love you twice as much * oh love * *"] – and similar tasks.
  • You can use the wildcard character without searching for anything specific at all, as in this phrase search: ["* * * * * * *"].
  • Even though www.googl.com is nothing but a “typosquatter” (someone reserving a domain name containing a popular misspelling) and search queries return very different results than Google, the site is still getting paid by Google – because it uses Google AdSense.
  • If you feel like restricting your search to university servers, you can write e.g. [c-tutorial site:.edu] to only search on the “edu” domain (you can also use Google Scholar). This works for country-domains like “de” or “it” as well.


Google Search Tips 2005 by Philipp Lenssen | Comments (25)

http://blog.outer-court.com/archive/2005-09-29-n85.html

-

Submitted by xyz黑板 on 星期五, 九月 30, 2005 - 22:20 资源 | 开发

Google搜索技巧2005版

译自Philipp Lenssen的文章"Google Search Tips 2005"
原(英)文链接:http://blog.outer-court.com/archive/2005-09-29-n85.html
翻译者:xyz黑板 http://xyzboard.com 或 aubucu@hotmail.com
注意:文中[]符号是为了突出关键词,在实际搜索中是不包含的;本文采用的是意译;本译文已经征得作者许可;本译文可任意转载,请保留本文的头信息

1. 双引号可以用减号代替,比如搜索["like this"]与搜索[like-this]是一个效果
2. Google不会处理一些特殊的字符,比如[#](几年前还不行,现在可以了,比如搜索[c#]已经可以搜到相应的结果),但是还有一些字符它不认识,比如搜索[t.]、[t-]与[t^]的结果是一样的
3. Google充许一次搜索最多32个关键词
4. 在单词前加~符号可以搜索同义词,比如你想搜索[house],同时也想找[home],你就可以搜索[~house]
5. 如果想得到Google索引页面的总数,可以搜索[* *]
6. Google可以指定数字范围搜索。搜索[2001..2005]相当于搜索含有2001、2002直到2005的任意一个数的网页
7. 搜索[define:css]相当于搜索css的定义,这招对想学习知识的人很有效;也可以用[what is css]搜索;对中文来说,也可以用[什么是css]之类的
8. Google有一定的人工智能,可以识别一些简单的短语如[when was Einstein born?]或[einstein birthday]
9. 通 过[link:]语法,可以寻找含有某个链接的网页,比如[link:blog.outer-court.com]将找到包括指向 blog.outer-court.com超级链接的网页(最新的Google Blog Search也支持这个语法),但是Google并不会给出所有的包含此链接的网页,因为它要保证pagerank算法不被反向工程(呵呵,可以参见那两 个Google创始人关于pagerank的论文,可下载)
10. 如果在搜索的关键词的最后输入[why?],就会在结果中出现链接到Google Answers的链接http://answers.google.com ,在里面可以进行有偿提问
11. 现 在出现了一种兴趣活动,叫做Google Hacking,其内容是使用Google搜索一些特定的关键词,以便找到有漏洞的、易被黑客攻击的站点。这个网站列出了这些关键词:Google Hacking Database (http://johnny.ihackstuff.com/index.php?module=prodreviews )
12. 在Google 中输入一组关键词时,默认是“与”搜索,就是搜索包含有所有关键词的网页。如果要“或”搜索,可以使用大写的[OR]或 [|],使用时要与关键词之间留有空格。比如搜索关键词[Hamlet (pizza | coke)],是让Google搜索页面中或页面链接描述中含有Hamlet,并含有pizza与coke两个关键词中任意一个的网页。
13. 并非所有的Google服务都支持相同的语法,比如在Google Group中支持 [insubject:test]之类的主题搜索。可以通过高级搜索来摸索这些关键词的用法:进入高级搜索之后设置搜索选项,然后观察关键字输入窗口中的关键字的变化
14. 有 时候Google懂得一些自然语言,比如搜索关键词[goog], [weather new york, ny], [new york ny]或[war of the worlds],此时Google会在搜索结果前显示出一个被业内称为“onebox”的结果,试试看吧!
15. 并 非所有的Google都是相同的,它因国家版本(或是说语言版本)而异。在US版下,搜索[site:stormfront.org]会有成千上万的结 果,而在德语版下,搜索[site:stormfront.org]的结果,嗯,自己看吧。Google的确与各国政府有内容审查协议,比如德国版,法国 版(网页搜索),中国版Google新闻
16. 有时候Google会提示你搜索结果很烂,比如你搜索关键词[jew]试试,Google会告诉你它给出的搜索结果很烂,然后给你一个解释:http://www.google.com/explanation.html
17. 以前,搜索某些关键词如[work at Google] 时会看到Google给自己打的广告。可以去http://www.google.com/jobs/了解Google的工作
18. 对 于一些“Googlebombed”(大概意思是指Google搜索的结果出问题了)的关键词,会有一个广告链接到:http: //googleblog.blogspot.com/2005/09/googlebombing-failure.html (中国大陆需要代理才能访问)。比如搜索[failure],第一条是美国布什总统介绍:)
19. 虽然现在Google还没有支持自然语言,但这里有一段录像显示了支持自然语言的搜索引擎的使用效果:http://blog.outer-court.com/videos/googlebrain.wmv
20. 有 人说在Google中搜索[president of the internet],其结第一条表明了president of the internet是谁,我也是这么认为的,而且你还可以使用这个logo支持本文作者:http://blog.outer- court.com/files/president.gif :)
21. Google现在不再有“stop words”(被强制忽略的关键词),比如搜索 [to be or not to be], Google返回的结果中间还列有相关的完整短语搜索结果
22. 在Google 计算器(http://www.google.com/help/features.html#calculator )中有个彩蛋:输入[what is the answer to life, the universe and everything?]时,会返回42。(关键词翻译过来的意思是指“生命、宇宙和一切的答案”,这是一个著名科幻小说中的情节,详情参见
http://en.wikipedia.org/wiki/The_Answer_to_Life,_the_Universe, _and_Everything)。试试吧,哈哈
23. 你可以在搜索时使用通配符[*],这在搜索诗词时特别有效。比如你可以搜一下["love you twice as much * oh love * *"] 试试
24. 同样,你的关键词可以全部都是通配符,比如搜索["* * * * * * *"]
25. www.googl.com是在输错网址后的结果,也是个搜索网站,但搜索结果与Google完全不同。而且此网站也赚Google的钱,因为它使用Google AdSense
26. 如 果你想把搜索结果限制在大学的网站之中,可以使用[site:.edu]关键词,比如[c-tutorial site:.edu],这样可以只搜索以edu结尾的网站。你也可以使用Google Scholar来达到这个目的。也可以使用[site:.de]或[site:.it]来搜索某个特定国家的网站

http://xyzboard.com/?q=node/49

18:10 发表在 Leisure | 查看全文 | 评论 (0)