<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: What I&#039;m Proposing</title>
	<atom:link href="http://gashousegraphs.com/2010/10/30/what-im-proposing/feed/" rel="self" type="application/rss+xml" />
	<link>http://gashousegraphs.com/2010/10/30/what-im-proposing/</link>
	<description></description>
	<lastBuildDate>Sun, 19 May 2013 03:52:38 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
	<item>
		<title>By: vivaelpujols</title>
		<link>http://gashousegraphs.com/2010/10/30/what-im-proposing/#comment-354</link>
		<dc:creator>vivaelpujols</dc:creator>
		<pubDate>Sun, 31 Oct 2010 18:03:36 +0000</pubDate>
		<guid isPermaLink="false">http://playahardnine.wordpress.com/?p=1920#comment-354</guid>
		<description><![CDATA[Ok that makes sense, I&#039;m a little fuzzy on the statistics of it anyways.  I think that Chris might prefer to do a binary variable anyway, so if it&#039;s able to be manipulated then it should be fine.]]></description>
		<content:encoded><![CDATA[<p>Ok that makes sense, I&#8217;m a little fuzzy on the statistics of it anyways.  I think that Chris might prefer to do a binary variable anyway, so if it&#8217;s able to be manipulated then it should be fine.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: stevesommer05</title>
		<link>http://gashousegraphs.com/2010/10/30/what-im-proposing/#comment-353</link>
		<dc:creator>stevesommer05</dc:creator>
		<pubDate>Sun, 31 Oct 2010 15:31:57 +0000</pubDate>
		<guid isPermaLink="false">http://playahardnine.wordpress.com/?p=1920#comment-353</guid>
		<description><![CDATA[I believe the absolute co-linearity is if you use all of the sub-components for a dummy variable with multi categories (i.e. using the 4 dummy variables for a 4 category variable, you&#039;d actually use 3).

I think it&#039;s less applicable to having multiple separate variables.]]></description>
		<content:encoded><![CDATA[<p>I believe the absolute co-linearity is if you use all of the sub-components for a dummy variable with multi categories (i.e. using the 4 dummy variables for a 4 category variable, you&#8217;d actually use 3).</p>
<p>I think it&#8217;s less applicable to having multiple separate variables.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: stevesommer05</title>
		<link>http://gashousegraphs.com/2010/10/30/what-im-proposing/#comment-352</link>
		<dc:creator>stevesommer05</dc:creator>
		<pubDate>Sun, 31 Oct 2010 15:13:51 +0000</pubDate>
		<guid isPermaLink="false">http://playahardnine.wordpress.com/?p=1920#comment-352</guid>
		<description><![CDATA[Fair enough, you&#039;ve researched it more than I.  In the long run I don&#039;t really care one way or the other. 1-10 is fine with me.

I wasn&#039;t suggesting the scout was the SME in how to create the variables re: regression; just that they have a good feel for the levels of the variables.  Might as well express the variable in a way that matches the levels the scout already sees so they are apt to fill it out.]]></description>
		<content:encoded><![CDATA[<p>Fair enough, you&#8217;ve researched it more than I.  In the long run I don&#8217;t really care one way or the other. 1-10 is fine with me.</p>
<p>I wasn&#8217;t suggesting the scout was the SME in how to create the variables re: regression; just that they have a good feel for the levels of the variables.  Might as well express the variable in a way that matches the levels the scout already sees so they are apt to fill it out.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: vivaelpujols</title>
		<link>http://gashousegraphs.com/2010/10/30/what-im-proposing/#comment-351</link>
		<dc:creator>vivaelpujols</dc:creator>
		<pubDate>Sun, 31 Oct 2010 14:53:06 +0000</pubDate>
		<guid isPermaLink="false">http://playahardnine.wordpress.com/?p=1920#comment-351</guid>
		<description><![CDATA[The scout is not the subject matter expert regarding which way the expression of the variables is more conducive to running a regression.  That&#039;s us.

Again, I would think that running a regression with 5 (or however many categories TPG chooses) dummy variables would not produce useable results.  What I&#039;m reading online is that unless you excluded the intercept, running a regression with all dummy variables yields absolute multicollinearity.]]></description>
		<content:encoded><![CDATA[<p>The scout is not the subject matter expert regarding which way the expression of the variables is more conducive to running a regression.  That&#8217;s us.</p>
<p>Again, I would think that running a regression with 5 (or however many categories TPG chooses) dummy variables would not produce useable results.  What I&#8217;m reading online is that unless you excluded the intercept, running a regression with all dummy variables yields absolute multicollinearity.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: stevesommer05</title>
		<link>http://gashousegraphs.com/2010/10/30/what-im-proposing/#comment-350</link>
		<dc:creator>stevesommer05</dc:creator>
		<pubDate>Sun, 31 Oct 2010 14:03:34 +0000</pubDate>
		<guid isPermaLink="false">http://playahardnine.wordpress.com/?p=1920#comment-350</guid>
		<description><![CDATA[Yes, would use regression.  I think it would be fine with 4-5 binary (categorical style) variables.  I wouldn&#039;t be locked into that approach though.  If the subject matter expert (a scout) thought that the variables would be better expressed on a different scale then I would be fine with that too.

I like the binary nature mainly for the ease of interpretation of results.]]></description>
		<content:encoded><![CDATA[<p>Yes, would use regression.  I think it would be fine with 4-5 binary (categorical style) variables.  I wouldn&#8217;t be locked into that approach though.  If the subject matter expert (a scout) thought that the variables would be better expressed on a different scale then I would be fine with that too.</p>
<p>I like the binary nature mainly for the ease of interpretation of results.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: vivaelpujols</title>
		<link>http://gashousegraphs.com/2010/10/30/what-im-proposing/#comment-349</link>
		<dc:creator>vivaelpujols</dc:creator>
		<pubDate>Sun, 31 Oct 2010 06:00:31 +0000</pubDate>
		<guid isPermaLink="false">http://playahardnine.wordpress.com/?p=1920#comment-349</guid>
		<description><![CDATA[Well I would assume you&#039;d want to run the thing in a regression.  How else would you tease out the multi-collinearity and measure the relative importance of each individual flaw?  Would a regression work if you regressed 4 or 5 binary variables on wOBA?

I agree with Valentine that this would be simpler and may reduce bias; however, I disagree that the subjectivity in deciding what&#039;s worse: 3-4, or 8-9, is a damning quality.  Isn&#039;t it much worse to assume that either a hitter has the flaw or doesn&#039;t?  That simplifies everything into a either perfect or terrible.

I would much rather have the 1-10 scale, all things being equal, but I suppose that&#039;s not really what Chris is going for in his analyses.]]></description>
		<content:encoded><![CDATA[<p>Well I would assume you&#8217;d want to run the thing in a regression.  How else would you tease out the multi-collinearity and measure the relative importance of each individual flaw?  Would a regression work if you regressed 4 or 5 binary variables on wOBA?</p>
<p>I agree with Valentine that this would be simpler and may reduce bias; however, I disagree that the subjectivity in deciding what&#8217;s worse: 3-4, or 8-9, is a damning quality.  Isn&#8217;t it much worse to assume that either a hitter has the flaw or doesn&#8217;t?  That simplifies everything into a either perfect or terrible.</p>
<p>I would much rather have the 1-10 scale, all things being equal, but I suppose that&#8217;s not really what Chris is going for in his analyses.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: stevesommer05</title>
		<link>http://gashousegraphs.com/2010/10/30/what-im-proposing/#comment-348</link>
		<dc:creator>stevesommer05</dc:creator>
		<pubDate>Sun, 31 Oct 2010 01:18:00 +0000</pubDate>
		<guid isPermaLink="false">http://playahardnine.wordpress.com/?p=1920#comment-348</guid>
		<description><![CDATA[I&#039;d do it this way initially for a couple of reasons.  I think it lessens subjectivity some, and thus hopefully reduces bias.  Additionally I think it&#039;s easier to interpret the results if the variables are binary (you either have the flaw or don&#039;t.  How do you use the information of going from a 4 to 5 in leaky back elbow lowers your wOBA by 0.005 pts?  It just seems hard to understand the 1-10 scale.  That being said, I think I&#039;d lean toward letting the scout pick the scale.]]></description>
		<content:encoded><![CDATA[<p>I&#8217;d do it this way initially for a couple of reasons.  I think it lessens subjectivity some, and thus hopefully reduces bias.  Additionally I think it&#8217;s easier to interpret the results if the variables are binary (you either have the flaw or don&#8217;t.  How do you use the information of going from a 4 to 5 in leaky back elbow lowers your wOBA by 0.005 pts?  It just seems hard to understand the 1-10 scale.  That being said, I think I&#8217;d lean toward letting the scout pick the scale.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: valatan</title>
		<link>http://gashousegraphs.com/2010/10/30/what-im-proposing/#comment-347</link>
		<dc:creator>valatan</dc:creator>
		<pubDate>Sat, 30 Oct 2010 23:47:17 +0000</pubDate>
		<guid isPermaLink="false">http://playahardnine.wordpress.com/?p=1920#comment-347</guid>
		<description><![CDATA[That would depend on how accurate the various thing are.  An ordinal variable would only really work if the differenc between a 3 and a 4 was the same as a difference between a 8 and a 9 , after all.]]></description>
		<content:encoded><![CDATA[<p>That would depend on how accurate the various thing are.  An ordinal variable would only really work if the differenc between a 3 and a 4 was the same as a difference between a 8 and a 9 , after all.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: vivaelpujols</title>
		<link>http://gashousegraphs.com/2010/10/30/what-im-proposing/#comment-346</link>
		<dc:creator>vivaelpujols</dc:creator>
		<pubDate>Sat, 30 Oct 2010 21:47:01 +0000</pubDate>
		<guid isPermaLink="false">http://playahardnine.wordpress.com/?p=1920#comment-346</guid>
		<description><![CDATA[It seems like binomial variables would make it hard to run a proper regression, why not just have him grade the degree of flaw on a 1-10 scale or something?]]></description>
		<content:encoded><![CDATA[<p>It seems like binomial variables would make it hard to run a proper regression, why not just have him grade the degree of flaw on a 1-10 scale or something?</p>
]]></content:encoded>
	</item>
</channel>
</rss>
