<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Probably Irrelevant &#187; Fernando</title>
	<atom:link href="http://probablyirrelevant.org/author/diazf/feed/" rel="self" type="application/rss+xml" />
	<link>http://probablyirrelevant.org</link>
	<description>Information Retrieval Research and Development</description>
	<lastBuildDate>Thu, 15 Jul 2010 20:43:14 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>&#8220;Economic Impact Assessment of NIST’s Text REtrieval Conference (TREC) Program&#8221;</title>
		<link>http://probablyirrelevant.org/2010/07/economic-impact-assessment-of-nist%e2%80%99s-text-retrieval-conference-trec-program/</link>
		<comments>http://probablyirrelevant.org/2010/07/economic-impact-assessment-of-nist%e2%80%99s-text-retrieval-conference-trec-program/#comments</comments>
		<pubDate>Thu, 15 Jul 2010 20:43:14 +0000</pubDate>
		<dc:creator>Fernando</dc:creator>
				<category><![CDATA[Conferences]]></category>
		<category><![CDATA[Evaluation]]></category>
		<category><![CDATA[Web Search]]></category>

		<guid isPermaLink="false">http://probablyirrelevant.org/?p=84</guid>
		<description><![CDATA[Thanks to your feedback,
&#8220;&#8230;this study estimates that TREC’s existence was responsible for approximately one-third of an improvement of more than 200% in web search products that was observed between 1999 and 2009.&#8221;
More here.
]]></description>
			<content:encoded><![CDATA[<p>Thanks to <a href="http://probablyirrelevant.org/2010/02/trec-survey/">your feedback</a>,</p>
<blockquote><p>&#8220;&#8230;this study estimates that TREC’s existence was responsible for approximately one-third of an improvement of more than 200% in web search products that was observed between 1999 and 2009.&#8221;</p></blockquote>
<p>More <a href="http://trec.nist.gov/pubs/2010.economic.impact.pdf">here</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://probablyirrelevant.org/2010/07/economic-impact-assessment-of-nist%e2%80%99s-text-retrieval-conference-trec-program/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SIGIR 2010 Best Paper Nominees</title>
		<link>http://probablyirrelevant.org/2010/07/sigir-2010-best-paper-nominees/</link>
		<comments>http://probablyirrelevant.org/2010/07/sigir-2010-best-paper-nominees/#comments</comments>
		<pubDate>Sat, 03 Jul 2010 22:18:57 +0000</pubDate>
		<dc:creator>Fernando</dc:creator>
				<category><![CDATA[Conferences]]></category>

		<guid isPermaLink="false">http://probablyirrelevant.org/?p=82</guid>
		<description><![CDATA[SIGIR has posted best paper nominees.

A comparison of general vs personalized affective models for the prediction of topical relevance, I. Arapakis, K. Athanasakos, J. Jose
Assessing the Scenic Route: Measuring the Value of Search Trails in Web Logs, R. White, J. Huang
Caching Search Engine Results over Incremental Indices, F. Junqueira, R. Blanco, E. Bortnikov, R. Lempel, [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.sigir2010.org/doku.php?id=program:awards">SIGIR has posted best paper nominees.</a></p>
<ul>
<li>A comparison of general vs personalized affective models for the prediction of topical relevance, I. Arapakis, K. Athanasakos, J. Jose</li>
<li>Assessing the Scenic Route: Measuring the Value of Search Trails in Web Logs, R. White, J. Huang</li>
<li>Caching Search Engine Results over Incremental Indices, F. Junqueira, R. Blanco, E. Bortnikov, R. Lempel, L. Telloli, H. Zaragoza</li>
<li>Comparing the Sensitivity of Information Retrieval Metrics, F. Radlinski, N. Craswell</li>
<li>Extending Average Precision to Graded Relevance Judgments, S. Robertson, E. Kanoulas, E. Yilmaz</li>
<li>Information Based Model for ad hoc information retrieval, S. Clinchant, E. Gaussier</li>
<li>Multi-style language model for web scale information retrieval, K. Wang, J. Gao, X. Li</li>
<li>Properties of Optimally Weighted Data Fusion in CBMIR, P. Wilkins, A. Smeaton</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://probablyirrelevant.org/2010/07/sigir-2010-best-paper-nominees/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Query logs and information retrieval research</title>
		<link>http://probablyirrelevant.org/2010/06/query-logs-and-information-retrieval-research/</link>
		<comments>http://probablyirrelevant.org/2010/06/query-logs-and-information-retrieval-research/#comments</comments>
		<pubDate>Wed, 02 Jun 2010 01:59:05 +0000</pubDate>
		<dc:creator>Fernando</dc:creator>
				<category><![CDATA[Evaluation]]></category>
		<category><![CDATA[Web Search]]></category>

		<guid isPermaLink="false">http://probablyirrelevant.org/?p=76</guid>
		<description><![CDATA[About one year ago,  Bruce Croft asked the IR community for help with getting access to query logs for academia,
The goal of this project is to create a database of web search activity that will be provided to the information retrieval research community to use on current and future information retrieval research projects.
To accomplish [...]]]></description>
			<content:encoded><![CDATA[<p>About one year ago,  Bruce Croft asked the IR community for help with getting access to query logs for academia,</p>
<blockquote><p>The goal of this project is to create a database of web search activity that will be provided to the information retrieval research community to use on current and future information retrieval research projects.</p></blockquote>
<p>To accomplish this, the Lemur Project developed a toolbar to be voluntarily installed by users.  After a year of data collection, <a href="http://lemurstudy.cs.umass.edu/">the project has been aborted</a>,</p>
<blockquote><p>Given that we have gathered the equivalent of less than 6 seconds of Google traffic (assuming 500 million queries per day) in one year, we have decided to terminate the project.</p></blockquote>
<p>This is pretty depressing news.  Admittedly, part of this depression originates from my guilt over not having contributed to the project myself.  However, a more substantial part stems from the potential this data set had to be groundbreaking, perhaps similar to the release of the first Tipster collections.  Although this was way before my time, I imagine the sudden release of a large, public corpus resulted in a tremendous amount of activity and excitement.</p>
<p>Information retrieval research has had large collections of documents for a few decades now.  We evaluate on a few hundred queries and publish results.  With some exceptions, the majority of interest in the field has focused on scaling up corpora.  As a result, we have rich set of tools to analyze and retrieve documents from large corpora.</p>
<p>There are two things missing from this model: a rich stream of queries coming into the system and a rich stream of interactions between users and documents.  Our friends in the CHI and information science communities have been doing a great job with understanding the important factors involved in user behavior on laboratory scale.  However, I&#8217;m going to draw an analogy here between small scale user studies for IR and document-level NLP analysis for IR that may raise a few eyebrows.  I believe that many IR researchers would argue that, given the choice between a corpus-driven approaches and NLP approaches to IR, they would opt for more data.  This is despite the rich analysis NLP can provide.  Similarly, I believe that the fine-grained analysis provided by laboratory studies may be less important than very large scale analysis of user behavior.  Of course, both the results about NLP for IR and the claim about laboratory experiments are based on relatively limited experiments (e.g. small sets of queries).  We should, as a community, continue research in all of these directions.</p>
<p>Having said this, let&#8217;s consider some motivations for web query logs and IR research,</p>
<p><strong>Claim 1. Web query logs will help with the contribution to web search research.</strong></p>
<p>There is no doubt that query logs are important for any search engine, web or otherwise.  However, query logs are only one of the many sources of interaction data available in production.  There are many, many other signals which can be effectively exploited for query understanding and document ranking.   In my opinion, outside of starting its own web search engine, academia will always be scurrying to catchup to industry&#8217;s data sources.</p>
<p>I convinced myself a few years ago that the resources required to build and maintain a web search engine may never exist in academia.  This is not to say that academic IR researchers should give up on having impact on web search engines.  IR research several decades old continues to impact modern search engine design.  What needs to be determined is how the current academic IR researchers can more directly address the problems confronted by web search companies.  I personally believe that a tight coupling between academic and industrial research labs needs to exist.  This could be accomplished in a number of ways.</p>
<ol>
<li> add value to an existing search engine&#8217;s interface.  If search engines provide ranker APIs, academics can develop new interfaces which may attract users and, as a result, interaction data.</li>
<li> teach the IR fundamentals during the academic year/perform intense interaction during the summer during internships or other collaborations.  I am most familiar with Yahoo&#8217;s <a href="http://labs.yahoo.com/ksc">Key Scientific Challenges Fellowships</a> and <a href="http://labs.yahoo.com/Academic_Relations/Faculty">Faculty Engagement Grants</a>.  Similar programs exist at other web search engines.</li>
<li> develop high-quality, public web search engine simulators which provide students/researchers with the ability to test algorithms <em>in silico</em>.  Our <a href="http://ciir.cs.umass.edu/~fdiaz/sigir09-DA.pdf">SIGIR 2009 paper</a> made extensive use of simulation whose parameters were grounded in real world data.  Systems research in computer architecture or computer networking have adopted this approach for a while.  SIGIR 2010 will be hosting a workshop on <a href="http://www.dcs.gla.ac.uk/access/simint/">simulated interaction</a>.</li>
</ol>
<p>No doubt there are many, many other alternatives.</p>
<p><strong>Claim 2. Web query logs will help with the contribution to production search research.</strong></p>
<p>As stated earlier, IR research has looked at the document side for many, many problems.  This research has benefited web search as well as search in other domains such as legal, news, and enterprise search.</p>
<p>User behavior data improved production web search engines; user behavior data will no doubt improve production non-web search engines.   Just as with web search though, this data does not exist in academia.</p>
<p>I believe, though, that the barrier to entry for non-web/vertical search engines is somewhat lower.  The collections are smaller and manageable.  At the same time, document representations can be richer for verticals, interaction is less constrained, and, as a result, the potential for attracting users may be higher than with portal web search engines.</p>
<p>If an academic institution maintained a domain-specific production search engine, academic research could become more relevant to industrial search engines.  For example, academic institutions would easily be able to publish about query logs, interaction, large scale adaptation, and online learning with large scale real world data.  One important, unresolved question is how to come to terms with experimental reproducibility and production data which is often closed due to privacy reasons.</p>
<p>Academic IR research will continue to contribute to general IR research.  Students trained in IR fundamentals will continue to be strong candidates for research and development in production search companies.  I believe that there is room for greater impact.  How that happens remains to be seen.</p>
]]></content:encoded>
			<wfw:commentRss>http://probablyirrelevant.org/2010/06/query-logs-and-information-retrieval-research/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Vintage Cornell/SMART Tech Reports?</title>
		<link>http://probablyirrelevant.org/2010/05/vintage-cornellsmart-tech-reports/</link>
		<comments>http://probablyirrelevant.org/2010/05/vintage-cornellsmart-tech-reports/#comments</comments>
		<pubDate>Wed, 05 May 2010 19:33:56 +0000</pubDate>
		<dc:creator>Fernando</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://probablyirrelevant.org/?p=72</guid>
		<description><![CDATA[A few years ago, someone published an online interface to many old Cornell/SMART tech reports from the Salton group.  Unfortunately, I cannot seem to find them anywhere now.  Who can help correct this irony?
Update: Here is the SIGIR Digital Museum of Information Retrieval Research, including those SMART reports.
]]></description>
			<content:encoded><![CDATA[<p>A few years ago, someone published an online interface to many old Cornell/SMART tech reports from the Salton group.  Unfortunately, I cannot seem to find them anywhere now.  Who can help correct this irony?</p>
<p><strong>Update:</strong> Here is the <a href="http://www.sigir.org/museum/">SIGIR Digital Museum of Information Retrieval Research</a>, including those SMART reports.</p>
]]></content:encoded>
			<wfw:commentRss>http://probablyirrelevant.org/2010/05/vintage-cornellsmart-tech-reports/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>TREC Survey</title>
		<link>http://probablyirrelevant.org/2010/02/trec-survey/</link>
		<comments>http://probablyirrelevant.org/2010/02/trec-survey/#comments</comments>
		<pubDate>Mon, 22 Feb 2010 17:52:18 +0000</pubDate>
		<dc:creator>Fernando</dc:creator>
				<category><![CDATA[Conferences]]></category>

		<guid isPermaLink="false">http://probablyirrelevant.org/?p=69</guid>
		<description><![CDATA[There is a survey being conducted about the impact of TREC on information retrieval research.  This feedback is important for organizers and I encourage researchers to participate.  If you are outside of the IR community and have ever used TREC collections, this is feedback is also valuable.
]]></description>
			<content:encoded><![CDATA[<p>There is <a href="https://trecsurvey.rti.org/Default.aspx">a survey being conducted about the impact of TREC on information retrieval research</a>.  This feedback is important for organizers and I encourage researchers to participate.  If you are outside of the IR community and have ever used TREC collections, this is feedback is also valuable.</p>
]]></content:encoded>
			<wfw:commentRss>http://probablyirrelevant.org/2010/02/trec-survey/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>SIGIR 2009 ACCEPTED PAPERS THREAD</title>
		<link>http://probablyirrelevant.org/2009/04/sigir-2009-accepted-papers-thread/</link>
		<comments>http://probablyirrelevant.org/2009/04/sigir-2009-accepted-papers-thread/#comments</comments>
		<pubDate>Thu, 09 Apr 2009 14:04:09 +0000</pubDate>
		<dc:creator>Fernando</dc:creator>
				<category><![CDATA[Conferences]]></category>

		<guid isPermaLink="false">http://probablyirrelevant.org/?p=55</guid>
		<description><![CDATA[SIGIR Poster decisions have been mailed.  Full paper decisions should be soon as well.  Authors are encouraged to post preprints/drafts of accepted publications in the comments section.
PAPERS
The program committee reviewed 494 full paper submissions and accepted 78, about a 16% acceptance rate.
In 2008, 497 submitted, 85 accepted, about a 17% acceptance rate.
POSTERS
The program [...]]]></description>
			<content:encoded><![CDATA[<p>SIGIR Poster decisions have been mailed.  Full paper decisions should be soon as well.  Authors are encouraged to post preprints/drafts of accepted publications in the comments section.</p>
<p><strong>PAPERS</strong></p>
<p>The program committee reviewed 494 full paper submissions and accepted 78, about a 16% acceptance rate.</p>
<p>In 2008, 497 submitted, 85 accepted, about a 17% acceptance rate.</p>
<p><strong>POSTERS</strong></p>
<p>The program committee reviewed 256 poster submissions and accepted 86, about a 34% acceptance rate.</p>
<p>In 2008, 173 submitted, 91 accepted, about a 53% acceptance rate.</p>
<p><strong>UPDATE</strong></p>
<p>Accepted papers <a href="http://www.sigir2009.org/Program/papers">published</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://probablyirrelevant.org/2009/04/sigir-2009-accepted-papers-thread/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>SIGIR 2009 Information</title>
		<link>http://probablyirrelevant.org/2008/12/sigir-2009-information/</link>
		<comments>http://probablyirrelevant.org/2008/12/sigir-2009-information/#comments</comments>
		<pubDate>Wed, 10 Dec 2008 21:37:12 +0000</pubDate>
		<dc:creator>Fernando</dc:creator>
				<category><![CDATA[Conferences]]></category>

		<guid isPermaLink="false">http://probablyirrelevant.org/?p=51</guid>
		<description><![CDATA[Conference URLs:

SIGIR 2009 Homepage
SIGIR 2009 Facebook Event

Important Dates:



Jan 19, 2009
Abstracts for full research papers due


Jan 26, 2009
Full research paper submissions due


Feb 2, 2009
Workshop proposals due


Feb 23, 2009
Posters, demonstration, and tutorial proposals due


Mar 2, 2009
Doctoral consortium proposals due


Mar 9, 2009
Notification of workshop acceptances


Apr 11, 2009
All other acceptance notification



]]></description>
			<content:encoded><![CDATA[<p>Conference URLs:</p>
<ul>
<li><a href="http://sigir2009.org">SIGIR 2009 Homepage</a></li>
<li><a href="http://www.facebook.com/home.php#/event.php?eid=42368171077">SIGIR 2009 Facebook Event</a></li>
</ul>
<p>Important Dates:</p>
<table border="0">
<tbody>
<tr>
<td>Jan 19, 2009</td>
<td>Abstracts for full research papers due</td>
</tr>
<tr>
<td>Jan 26, 2009</td>
<td>Full research paper submissions due</td>
</tr>
<tr>
<td>Feb 2, 2009</td>
<td>Workshop proposals due</td>
</tr>
<tr>
<td>Feb 23, 2009</td>
<td>Posters, demonstration, and tutorial proposals due</td>
</tr>
<tr>
<td>Mar 2, 2009</td>
<td>Doctoral consortium proposals due</td>
</tr>
<tr>
<td>Mar 9, 2009</td>
<td>Notification of workshop acceptances</td>
</tr>
<tr>
<td>Apr 11, 2009</td>
<td>All other acceptance notification</td>
</tr>
</tbody>
</table>
]]></content:encoded>
			<wfw:commentRss>http://probablyirrelevant.org/2008/12/sigir-2009-information/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>WSDM Accepted Papers Posted</title>
		<link>http://probablyirrelevant.org/2008/10/wsdm-accepted-papers-posted/</link>
		<comments>http://probablyirrelevant.org/2008/10/wsdm-accepted-papers-posted/#comments</comments>
		<pubDate>Tue, 28 Oct 2008 16:39:57 +0000</pubDate>
		<dc:creator>Fernando</dc:creator>
				<category><![CDATA[Conferences]]></category>

		<guid isPermaLink="false">http://probablyirrelevant.org/?p=34</guid>
		<description><![CDATA[WSDM 2009 accepted papers posted here.
]]></description>
			<content:encoded><![CDATA[<p>WSDM 2009 accepted papers posted <a href="http://www.wsdm2009.org/papers.php">here</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://probablyirrelevant.org/2008/10/wsdm-accepted-papers-posted/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Blogs, queries, corpora</title>
		<link>http://probablyirrelevant.org/2008/09/blogs-queries-corpora/</link>
		<comments>http://probablyirrelevant.org/2008/09/blogs-queries-corpora/#comments</comments>
		<pubDate>Thu, 11 Sep 2008 16:59:08 +0000</pubDate>
		<dc:creator>Fernando</dc:creator>
				<category><![CDATA[Blog Search]]></category>

		<guid isPermaLink="false">http://probablyirrelevant.org/?p=9</guid>
		<description><![CDATA[In 2006, I was studying information retrieval at the University of Massachusetts and, during a Friday of extreme impatience, I installed WordPress, started apached and created a blog called “Information Retrieval”.  After a handful of posts over the course of six months, the comments queue filled with spam and WordPress stopped working.  It [...]]]></description>
			<content:encoded><![CDATA[<p>In 2006, I was studying information retrieval at the University of Massachusetts and, during a Friday of extreme impatience, I installed WordPress, started apached and created a blog called “Information Retrieval”.  After a handful of posts over the course of six months, the comments queue filled with spam and WordPress stopped working.  It is with this dubious evidence that I have been asked my esteemed colleagues to write the first post of “Probably Irrelevant”.  The talent represented by those nominating me will ensure that “Probably Irrelevant” will see a little more life than “Information Retrieval” (if it has not already based on the title alone).</p>
<p>Now, it seems appropriate that the inaugural post of an information retrieval blog should address the subject of “blog search”.  Unfortunately, I am dreadfully less qualified than my co-authors to discuss the state of the art.  So, I apologize in advance for errors, omissions, or general ridiculousness and lay blame on Kevyn and Jonathan.</p>
<p>Now, when I started “Information Retrieval”, one of the first messages I received was from a senior member of the IR community.  He wrote,</p>
<blockquote><p>Maybe you could blog about why anyone is interested in blogs :-)</p></blockquote>
<p>I replied,</p>
<blockquote><p>I&#8217;ll keep this in mind when you&#8217;re chairing a session on blog search at SIGIR 2010.</p></blockquote>
<p>I will not identify the original commenter but encourage conference attendees to pay attention in Geneva.</p>
<p>Of course, this comment deserves some thought.  One of the issues with blog search is the under-defined taxonomy of queries.  The TREC Blog Track defines the following tasks</p>
<ul>
<li>blog post retrieval (i.e. “Find me posts about X.”)</li>
<li>opinion retrieval (i.e. “What do people think about X?”)</li>
<li>polarity (i.e. “Find me positive posts about X.”)</li>
<li>feed distillation (i.e. “Find me a blog with a principle, recurring interest in X.”)</li>
</ul>
<p>One question I hope will be resolved in the comments is where these query types came from.  Are they derived from actual blog searchers?   Or are they merely contrived by the track organizers while trading pints at the Gaithersburg Marriot?  These are questions, not criticisms. I think these are fine tasks but we have to be careful to define queries which are representative of those being issued blog search engines or, more generally, fulfill some desire users have.  The problem with a new corpus is that how users interact with it is still not completely developed.  What users will actually use these systems?  Casual blog readers?  Marketers?  Political scientists?  Sociologists?</p>
<p>The majority of time in an “Introduction to Information Retrieval” course is devoted to modeling documents.  And, yes, we have sophisticated models of documents. We decompose individual documents using passages, sentences, or other exploitable structure.  We also model the corpus as a whole either explicitly (e.g. cluster-based retrieval, latent semantic indexing, regularization) or implicitly (e.g. pseudo-relevance feedback).</p>
<p>For an information retrieval researcher, a corpus without queries is a corpse.  Queries make information retrieval different from unsupervised learning.  Also, because they are so short, queries make information retrieval different from traditional text classification.  While information retrieval research has focused on ranking documents given a query, prior to the late 1990s, there were very few (published) results on modeling queries in aggregate.  However, with the advent of web search engines, there has been a growing body of work on such models.  These include descriptive studies of web query frequencies and user clicking behavior as well as models for query similarity and clicking behavior.  These results have mainly been presented for web users and queries; I would be very interested in seeing whether the results generalize to non-web search scenarios.</p>
<p>To come back to blog search, I believe we need a better understanding of both the corpus and the queries before defining tasks.  Blog corpora exist and are actively being studied.  I am less certain about blog queries.  One approach would be to inspect query logs to blog search engines for different retrieval scenarios and then improve performance for those scenarios.  Of course, some of us are engineers who sometimes desire to build a tool because we believe it would be used.  However, if there is a mismatch between what we believe will be useful and what users find useful, then we have wasted time.*</p>
<p>I’ve touched on a lot in this first post and hope it serves as a starting point of discussion.  So, welcome to &#8220;Probably Irrelevant&#8221;.</p>
<p>*I just became aware of a paper to be presented at CIKM entitled “What Should Blog Search Look Like?” which I hope will answer some of these questions.</p>
<p><em>Editor&#8217;s Note:</em><em> Many thanks to Fernando for authoring our first post.  He couldn&#8217;t have chosen a more timely topic, the TREC 2008 Blog Track judgements are underway, <a href="http://terrierteam.blogspot.com/2008/09/about-blog-search-tasks.html">Iadh Ounis as recently posted a call for suggestions for the 2009 tasks</a>, <a href="http://www.searchenginecaffe.com/2008/09/trec-2009-blog-track-thoughts.html">Jeff Dalton has an insightful response</a>, and <a href="http://people.ischool.berkeley.edu/%7Ehearst/papers/blogsearch08.pdf">Marti Hearst&#8217;s paper is now online</a>.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://probablyirrelevant.org/2008/09/blogs-queries-corpora/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>
