<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: What is different about highly effective retrievals?</title>
	<atom:link href="http://probablyirrelevant.org/2008/09/what-is-different-about-highly-effective-retrievals/feed/" rel="self" type="application/rss+xml" />
	<link>http://probablyirrelevant.org/2008/09/what-is-different-about-highly-effective-retrievals/</link>
	<description>Information Retrieval Research and Development</description>
	<lastBuildDate>Thu, 30 Jun 2011 14:19:08 -0400</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.4</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: miles</title>
		<link>http://probablyirrelevant.org/2008/09/what-is-different-about-highly-effective-retrievals/comment-page-1/#comment-23</link>
		<dc:creator>miles</dc:creator>
		<pubDate>Sun, 21 Sep 2008 20:01:21 +0000</pubDate>
		<guid isPermaLink="false">http://probablyirrelevant.org/?p=28#comment-23</guid>
		<description>Fernando-- I suspect you&#039;re right, and it is likely that the top-performing systems are retrieving unjudged relevant documents.  But by virtue of _being_ the best performers, they are also retrieving the judged relevant docs.  So it&#039;s easy for me to believe that there are some very good systems far down in the rankings (having had the bad luck to retrieve unjudged relevant docs).  But it&#039;s not clear to me how relevant documents that weren&#039;t judged due to pooling would hurt the relret for high performers.</description>
		<content:encoded><![CDATA[<p>Fernando&#8211; I suspect you&#8217;re right, and it is likely that the top-performing systems are retrieving unjudged relevant documents.  But by virtue of _being_ the best performers, they are also retrieving the judged relevant docs.  So it&#8217;s easy for me to believe that there are some very good systems far down in the rankings (having had the bad luck to retrieve unjudged relevant docs).  But it&#8217;s not clear to me how relevant documents that weren&#8217;t judged due to pooling would hurt the relret for high performers.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Fernando</title>
		<link>http://probablyirrelevant.org/2008/09/what-is-different-about-highly-effective-retrievals/comment-page-1/#comment-22</link>
		<dc:creator>Fernando</dc:creator>
		<pubDate>Sun, 21 Sep 2008 15:48:54 +0000</pubDate>
		<guid isPermaLink="false">http://probablyirrelevant.org/?p=28#comment-22</guid>
		<description>A few explanations for why your top runs may have low relret

pooling depth.  compute metric for the pooling depth.  there could be unjudged relevant documents.  automatic runs may be saved because their unjudged documents are more likely to be within in the pooling depth of another run.
manual runs may be high precision, as opposed to high recall and we can certainly have methods with high precision which are low recall.  
</description>
		<content:encoded><![CDATA[<p>A few explanations for why your top runs may have low relret</p>
<p>pooling depth.  compute metric for the pooling depth.  there could be unjudged relevant documents.  automatic runs may be saved because their unjudged documents are more likely to be within in the pooling depth of another run.<br />
manual runs may be high precision, as opposed to high recall and we can certainly have methods with high precision which are low recall.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Fernando</title>
		<link>http://probablyirrelevant.org/2008/09/what-is-different-about-highly-effective-retrievals/comment-page-1/#comment-21</link>
		<dc:creator>Fernando</dc:creator>
		<pubDate>Sun, 21 Sep 2008 15:48:28 +0000</pubDate>
		<guid isPermaLink="false">http://probablyirrelevant.org/?p=28#comment-21</guid>
		<description>Before starting a discussion about performance prediction the following questions need to be answered,

 why are you interested in performance prediction?  do you want to merge results?  abandon (or select) a ranking system for the query?  abandon (or select) a ranking a system system for all queries?  not all of these suggest the optimizing the same metric.
do you really not have relevance information?  performance predictors often provide relevance surrogates (e.g. frequently retrieved documents across systems).  does your system—from cold-start or warmed-up—have other sources of relevance surrogates?  
</description>
		<content:encoded><![CDATA[<p>Before starting a discussion about performance prediction the following questions need to be answered,</p>
<p> why are you interested in performance prediction?  do you want to merge results?  abandon (or select) a ranking system for the query?  abandon (or select) a ranking a system system for all queries?  not all of these suggest the optimizing the same metric.<br />
do you really not have relevance information?  performance predictors often provide relevance surrogates (e.g. frequently retrieved documents across systems).  does your system—from cold-start or warmed-up—have other sources of relevance surrogates?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jon Elsas</title>
		<link>http://probablyirrelevant.org/2008/09/what-is-different-about-highly-effective-retrievals/comment-page-1/#comment-20</link>
		<dc:creator>Jon Elsas</dc:creator>
		<pubDate>Sat, 20 Sep 2008 04:27:35 +0000</pubDate>
		<guid isPermaLink="false">http://probablyirrelevant.org/?p=28#comment-20</guid>
		<description>Brendan -- thanks for the links -- great posts.

Judgement-free evaluation has utility beyond absolute system assessment.  In meta search, or unsupervised rank aggregation, we often want to know which system&#039;s output is more reliable at query time in order to inform how results are merged.</description>
		<content:encoded><![CDATA[<p>Brendan &#8212; thanks for the links &#8212; great posts.</p>
<p>Judgement-free evaluation has utility beyond absolute system assessment.  In meta search, or unsupervised rank aggregation, we often want to know which system&#8217;s output is more reliable at query time in order to inform how results are merged.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Brendan O'Connor</title>
		<link>http://probablyirrelevant.org/2008/09/what-is-different-about-highly-effective-retrievals/comment-page-1/#comment-19</link>
		<dc:creator>Brendan O'Connor</dc:creator>
		<pubDate>Fri, 19 Sep 2008 22:46:33 +0000</pubDate>
		<guid isPermaLink="false">http://probablyirrelevant.org/?p=28#comment-19</guid>
		<description>Why not always use human-judged relevance information?  On Amazon Mech. Turk you can get hundreds of relevance judgments for a dollar.

Sorry to promote my own posts, but here&#039;s the evidence...

http://blog.doloreslabs.com/2008/09/amt-fast-cheap-good-machine-learning/
http://blog.doloreslabs.com/2008/04/search-engine-relevance-an-empirical-test/</description>
		<content:encoded><![CDATA[<p>Why not always use human-judged relevance information?  On Amazon Mech. Turk you can get hundreds of relevance judgments for a dollar.</p>
<p>Sorry to promote my own posts, but here&#8217;s the evidence&#8230;</p>
<p><a href="http://blog.doloreslabs.com/2008/09/amt-fast-cheap-good-machine-learning/" rel="nofollow">http://blog.doloreslabs.com/2008/09/amt-fast-cheap-good-machine-learning/</a><br />
<a href="http://blog.doloreslabs.com/2008/04/search-engine-relevance-an-empirical-test/" rel="nofollow">http://blog.doloreslabs.com/2008/04/search-engine-relevance-an-empirical-test/</a></p>
]]></content:encoded>
	</item>
</channel>
</rss>

