SearchMob!


Powered by Rollyo

Recent Comment
Spotlight

  • Reader Yong writes: Regardless of this debate, google should be so intelligent as to take the necessary steps to prevent the abuse of its power. Instead of waiting for a disaster to happen, which may actually harm people. [go]

Recent Comments

  • Motoryzacja: " June 28 was few days ago and I still did ..." [go]
  • Motoryzacja: " June 28 was few days ago and I still did ..." [go]
  • gary price: " Btw, speaking of Nutch, the first two ma ..." [go]
  • Rex: " We (MSNBC.com) are a co-sponsor of this ..." [go]
  • kelebek: " ..." [go]
  • porno: " ..." [go]
  • porno: " hi ..." [go]
  • Jordan: " I talked to someone who interviewed ther ..." [go]
  • Matt Cox: " From what I see, it looks impressive - w ..." [go]
  • Sarah: " sure it sounds nice to get all the servi ..." [go]
  • jerryk: " Yeah, My 800 # got called from Merchantc ..." [go]
  • Brad Collins: " This isn't an Open Source story, it's an ..." [go]
  • Bev: " I, also received a call from them this a ..." [go]
  • Nancy: " Sunday morning they call me with the "ne ..." [go]
  • Lisa: " Yes, they are definitely “still at it”! ..." [go]
  • Joe Clark: " Only if you can't differentiate “Fundy” ..." [go]

PERFECT FOR THAT PERSON WITH EVERYTHING
Order 'The Search'

thesearch_bookcover.jpg

Yup, it makes the perfect gift for that officemate or colleague who you thought had everything....including you! If you order here, I promise to sign it, assuming we can figure out the shipping...

You can also buy the audio version here.

Check my book page for more info.

Blogger's Rights

Top Posts

Active Topics

Monthly Archives

About John Battelle

Searchblog Newsletter

Enter email to subscribe to "Re-Find", Searchblog's weekly newsletter:


Calendar

August 2007
Su Mo Tu We Th Fr Sa
      1 2 3 4
5 6 7 8 9 10 11
12 13 14 15 16 17 18
19 20 21 22 23 24 25
26 27 28 29 30 31  

Syndicate

Powered by

November 28, 2003 11:48 AM

The Search Papers: Defining Intent

I've just finished reading A Taxonomy of Web Search by Andrei Broder, written largely while the author was CTO of Alta Vista (and using AV query data), and published after he moved to IBM Research in 2001.

The paper has a trove of references to other papers, which is good for my work, and it has a singular thesis: that all web searches are not equal. Broder sets out to dispel the notion that all searches are "informational" in nature. He instead maintains that many are "transactional" or "navigational" in nature. These two seemingly obvious categories are in fact relatively new to the academic field of Information Retrieval (IR), which developed largely in the context of large islands of data (ie, in the 70s/80s), rather than in the web era.

What I like about this paper is the use of the word "intent" - which over the years I've come to use quite a bit (see my last column on video advertising over the internet, in which I rant once again on "intent over content", or my post on The Database of Intentions). Intent is behind every kind of search, Broder says, but "there is no assumption ... that this intent can be inferred with any certitude from the query." Ay, there's the rub....To get to that intent, Broder employed a short survey on the site.

A few fun facts from Broder's analysis of response and related log data:
- nearly 15% of searchers wish for "a good collection of links on a subject" as opposed to "a good document."
- 12% of queries in the log data used were sexual in nature
- nearly 25% of searchers were looking for "a specific website that I already had in mind."
- An estimated 36% of searchers were looking for transactional information - what Broder calls "the intent to perform some web-mediated activity."

Broder concludes that the next generation of search engines will need to take into account this new taxonomy of intent - transactions, navigation, as well as informational. Given that this paper was published in late 2001, it's interesting to see how the major engines already are on that path - with Yahoo's focus on shopping being one of the best examples.


TrackBack

Listed below are links to weblogs that reference The Search Papers: Defining Intent:

» Feedster hack: Where the heck did you see that post last week? from The Story of Feedster
According to Andrei Broder, 25% of web searches are looking for a specific URL, not general info. (Thanks to John Battelle for the link.)Oh, [Read More]

» A9 Search Goes Live... With The Attendent Privacy Issues from Napsterization
From Amazon. What they say it will do: Search Inside the Book: In addition to web search results we present book results from Amazon.com that include Search Inside the Book. When you see an excerpt on any of the book... [Read More]

Comments

I think that web taxonomy has changed because war between web search engines for a high $market, i will check yesterday store that have support for it devices as terminals Barcode Scanners Printers but i searching a solution to generate barcode tags from flash interface.

I RECEIVED A COPYRIGHT FOR MY POEM fOOTPRINTS IN THE SAND. I THINK IT WAS FOR APRIL 25TH, 1978 OR 1979. THE COPYRIGHT PEOPLE KEEP SAYING THEY CANNOT FIND IT. I DISTINCTLY REMEMBERING I RECEIVED IT. NEED HELP TO FIND IT. IT IS LIKE THE ARCH OF THE COVENANT STORED AWAY SOMEWHERE IN THE BOWELS OF THE LIBRARY OF CONGRESS. PLEASE HELP ME.

The interesting thing I get from those statistics is that people who search really don't know what the hell they really want (at least at the beginning). I think more and more searches are being geared towards generally educating the searcher as opposed to "getting the right answer now".

thank you for interesting info

If i search someting in Venezuela, i use google.co.ve but almost the most site about venezuela i found are from us companies, without any intres in the county only making money with adsense and co. US dominaz everywhere in this world, I shot the sheriff :)

An estimated 36% of searchers were looking for transactional information - what Broder calls "the intent to perform some web-mediated activity." im not sure that this is correct because, need more information statisticals. For example when a people search a term impresora pvc only wayback and another solution have the form to know the stats, because search engine drive more wanted text.

Post a comment

Human detector
Please enter the letter "q" in the field below. If you want to preview your comment before posting, enter the secret letter after previewing, not now, as the letter will change upon preview.

Enter the letter from above:

Searchblog Classifieds!

Recent Jobs

Searchblog, in paperback

Searchblog
Print Edition

Get Your Own Print Version of Searchblog

Get the book

Click here to buy a customized print version of the entire contents of Searchblog.

Categories

Search Resources

License