<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>sixsigmaZ &#187; Six Sigma Resources Online</title>
	<atom:link href="http://sixsigmaz.com/topics/spc/anova/feed/" rel="self" type="application/rss+xml" />
	<link>http://sixsigmaz.com</link>
	<description>Six Sigma Resources Online</description>
	<lastBuildDate>Thu, 27 May 2010 09:51:57 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
<meta xmlns="http://www.w3.org/1999/xhtml" name="robots" content="noindex,follow" />
		<item>
		<title>Download the Six Sigma Handbook Third Edition</title>
		<link>http://sixsigmaz.com/2009/11/23/download-the-six-sigma-handbook-third-edition/</link>
		<comments>http://sixsigmaz.com/2009/11/23/download-the-six-sigma-handbook-third-edition/#comments</comments>
		<pubDate>Mon, 23 Nov 2009 11:46:45 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[ANOVA]]></category>
		<category><![CDATA[Books]]></category>
		<category><![CDATA[SPC]]></category>
		<category><![CDATA[Six Sigma News]]></category>
		<category><![CDATA[Training]]></category>
		<category><![CDATA[customer requirements]]></category>
		<category><![CDATA[innovative products]]></category>
		<category><![CDATA[lean six sigma]]></category>
		<category><![CDATA[management responsibilities]]></category>
		<category><![CDATA[practice of quality management]]></category>
		<category><![CDATA[quality management strategy]]></category>
		<category><![CDATA[six sigma methodology]]></category>
		<category><![CDATA[statistical techniques]]></category>
		<category><![CDATA[streamlined format]]></category>
		<category><![CDATA[successful project management]]></category>

		<guid isPermaLink="false">http://sixsigmaz.com/?p=829</guid>
		<description><![CDATA[The Six Sigma Handbook, Third Edition is the only comprehensive reference you need to make Six Sigma work for your company. The book explains how to organize for Six Sigma, how to use customer requirements to drive strategy and operations, how to carry out successful project management, and more. The authoritative classic&#8211;revised and updated for [...]]]></description>
			<content:encoded><![CDATA[<p>The Six Sigma Handbook, Third Edition is the only comprehensive reference you need to make Six Sigma work for your company. The book explains how to organize for Six Sigma, how to use customer requirements to drive strategy and operations, how to carry out successful project management, and more.</p>
<p>The authoritative classic&#8211;revised and updated for today&#8217;s Six Sigma practitioners</p>
<p>Whether you want to further your Six Sigma training to achieve a Black or Green Belt or you are totally new to the quality-management strategy, you need reliable guidance. The Six Sigma Handbook, ThirdEdition shows you, step by step, how to integrate this profitable approach into your company&#8217;s culture. </p>
<p>Co-written by an award-winning contributor to the practice of quality management and a successful Six Sigma trainer, this hands-on guide features:<br />
- Cutting-edge, Lean Six Sigma concepts integrated throughout<br />
- Completely revised material focused on project objectives<br />
- Updated and expanded problem-solving examples using Excel and Minitab<br />
- A streamlined format that puts proven practices at your fingertips</p>
<p>The Six Sigma Handbook, Third Edition is the only comprehensive reference you need to make Six Sigma work for your company. The book explains how to organize for Six Sigma, how to use customer requirements to drive strategy and operations, how to carry outsuccessful project management, and more. Learn all the management responsibilities and actions necessary for a successful deployment, as well as how to:<br />
- Dramatically improve products and processes using DMAIC and DMADV<br />
- Use Design for Six Sigma to create innovative products and processes<br />
- Incorporate lean, problem-solving, and statistical techniques within the Six Sigma methodology<br />
- Avoid common pitfalls during implementation</p>
<p>Six Sigma has evolved with the changing global economy, and The Six Sigma Handbook, Third Edition is your key to ensuring that your company realizes significant gains in quality, productivity, and sales in today&#8217;s business climate.</p>
<p>560 pages<div id="attachment_830" class="wp-caption alignright" style="width: 330px"><a href="http://sixsigmaz.com/wp-content/uploads/2009/11/sixsigmahandbookthirdedition.png" rel="lightbox[829]"><img src="http://sixsigmaz.com/wp-content/uploads/2009/11/sixsigmahandbookthirdedition.png" alt="Six Sigma Handbook Third Edition" title="sixsigmahandbookthirdedition" width="320" height="407" class="size-full wp-image-830" /></a><p class="wp-caption-text">Six Sigma Handbook Third Edition</p></div><br />
McGraw-Hill Professional;<br />
3 edition (September 21, 2009)<br />
ISBN: 0071623388<br />
PDF 9.6Mb</p>
<p>Download it from http://rapidshare.com/files/310019896/McGraw-Hill_-_The_Six_Sigma_Handbook_3rd_Edition__2009_.zip or </p>
<p>http://www.10xdownloads.com/servers.asp?pb=2&#038;PID=2bc60e94-1b70-4fdc-9f6f-d525c70aaf4f&#038;jstyle=3&#038;q=The%20Six%20Sigma%20Handbook,%203rd%20Edition</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save"><img src="http://sixsigmaz.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a> </p><h4 class='related-posts-header'>Related Posts</h4><ul class="related-posts-list"><li class="related-post"><a href="http://sixsigmaz.com/2010/05/27/how-to-use-value-stream-mapping-to-select-lean-six-sigma-projects/">How To Use Value Stream Mapping To Select Lean Six Sigma Projects</a> <span class="related-post-date timestamp">Thu 27 May 2010</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/10/01/minitabs-latest-quality-companion-software-update-adds-value-stream-mapping/">Minitab's Latest Quality Companion Software Update Adds Value Stream Mapping</a> <span class="related-post-date timestamp">Thu 01 Oct 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2010/05/25/genpact-recognized-among-top-100-global-supply-chain-service-providers/">Genpact Recognized Among Top 100 Global Supply Chain Service Providers</a> <span class="related-post-date timestamp">Tue 25 May 2010</span></li><li class="related-post"><a href="http://sixsigmaz.com/2010/05/18/top-paying-jobs-in-six-sigma/">Top Paying Jobs in Six Sigma</a> <span class="related-post-date timestamp">Tue 18 May 2010</span></li><li class="related-post"><a href="http://sixsigmaz.com/2010/05/18/six-sigma-and-small-business-%e2%80%93-a-winning-combination/">Six Sigma and Small Business – A Winning Combination</a> <span class="related-post-date timestamp">Tue 18 May 2010</span></li><li class="related-post"><a href="http://sixsigmaz.com/2010/05/17/understanding-the-role-of-communication-in-the-process-of-six-sigma-certification/">Understanding the Role of Communication in the Process of Six Sigma Certification</a> <span class="related-post-date timestamp">Mon 17 May 2010</span></li></ul>]]></content:encoded>
			<wfw:commentRss>http://sixsigmaz.com/2009/11/23/download-the-six-sigma-handbook-third-edition/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Six Sigma and Minitab A complete Toolbox Guide for All Six Sigma Practitioners</title>
		<link>http://sixsigmaz.com/2009/11/11/six-sigma-and-minitab-a-complete-toolbox-guide-for-all-six-sigma-practitioners/</link>
		<comments>http://sixsigmaz.com/2009/11/11/six-sigma-and-minitab-a-complete-toolbox-guide-for-all-six-sigma-practitioners/#comments</comments>
		<pubDate>Wed, 11 Nov 2009 11:29:08 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[ANOVA]]></category>
		<category><![CDATA[Books]]></category>
		<category><![CDATA[DMAIC]]></category>
		<category><![CDATA[DOE]]></category>
		<category><![CDATA[SPC]]></category>
		<category><![CDATA[Training]]></category>
		<category><![CDATA[Black Belt]]></category>
		<category><![CDATA[blackbelts]]></category>
		<category><![CDATA[Green Belt]]></category>
		<category><![CDATA[minitab]]></category>

		<guid isPermaLink="false">http://sixsigmaz.com/?p=669</guid>
		<description><![CDATA[Six Sigma and Minitab: A complete toolbox guide for all Six Sigma practitioners (2nd edition) by Quentin Brook In completing my final week of black belt training, my esteemed master black belt, George Woodley.  Thanks George. Description The long awaited 2nd edition of this bestselling pocket guide on Six Sigma contains a range of new [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_670" class="wp-caption alignright" style="width: 260px"><a href="http://sixsigmaz.com/wp-content/uploads/2009/11/six-sigma-and-minitab.png" rel="lightbox[669]"><img class="size-full wp-image-670" title="six-sigma-and-minitab" src="http://sixsigmaz.com/wp-content/uploads/2009/11/six-sigma-and-minitab.png" alt="Six Sigma and Minitab" width="250" height="164" /></a><p class="wp-caption-text">Six Sigma and Minitab</p></div>
<p>Six Sigma and Minitab: A complete toolbox guide for all Six Sigma practitioners (2nd edition) by Quentin Brook</p>
<p>In completing my final week of black belt training, my esteemed master black belt, George Woodley.  Thanks George.</p>
<p><strong>Description</strong></p>
<p>The long awaited 2nd edition of this bestselling pocket guide on Six Sigma contains a range of new tools and techniques including new Lean material and Improve tools. In addition it has been updated for Minitab 15 (whilst still compatible with versions 13 and 14). Cutting through Six Sigma s strange terminology and consultancy speak, this guide delivers Six Sigma in a down to earth, logical and user friendly format. Minitab: For each tool, this guide details how to enter the data into Minitab, interpret the results and avoid the common pitfalls. Interactive: All the data files and templates are available online. Routemaps: A logical flow is provided through each DMAIC phase. Who should use this book? Six Sigma Trainees: Both during and after training, this guide provides an invaluable reference text to those who are actually implementing Lean Six Sigma improvement projects. Six Sigma Project Sponsors and Managers: For those who are accountable for deploying Six Sigma or sponsoring Six Sigma projects, and who might not have been fully Six Sigma trained, this guide will provide an overview of the tools and techniques that your project teams are trained in.</p>
<p><strong>Here are some customer reviews we spotted on Amazon.</strong></p>
<p>I really like how this &#8220;toolbox&#8221; book is put together. It is quick and to the point in a compact format. It shows what minitab tests can be used within each phase of the DMAIC process. I recommend it for the person who already has at least a bit of an understanding of the Six Sigma philosophy. It is the kind of book you will use for minitab reference over and over again.</p>
<p>This book is my go-to resource for DMAIC processes. It provides simple explanations of the tools and outcomes and is the reference material I always recommend to new belts.</p>
<p>Really good book for anybody using Minitab in Six Sigma or just plain statistics. Not too superficial (like many &#8216;Management Books&#8217;) and not too many silly examples (like some &#8216;Text Books&#8217;). Really useful. After reviewing it we bought one copy for each Black Belt at our company.</p>
<p>I do agree, however, that the new listed price is $49.99 given that much of Minitab’s documentation is now complete and freely available on their website.</p>
<p><strong>Spiral-bound:</strong> 240 pages</p>
<p><strong>Publisher:</strong> QSB Consulting (October 23, 2006)</p>
<p><strong>Language:</strong> English</p>
<p><strong>ISBN-10:</strong> 0954681320</p>
<p><strong>ISBN-13:</strong> 978-0954681326</p>
<p>Also, if you’re looking for a great Master Black Belt, mentor, statistician, I highly recommend George Woodley.  George learned Six Sigma at Motorola, taught SPC at Ford but to name a few of his stints.  Check out his website at <a href="http://qualityforall.net/">http://qualityforall.net</a></p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save"><img src="http://sixsigmaz.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a> </p><h4 class='related-posts-header'>Related Posts</h4><ul class="related-posts-list"><li class="related-post"><a href="http://sixsigmaz.com/2009/08/27/six-sigma-black-belts-what-should-they-know/">Six Sigma Black Belts What Should They Know?</a> <span class="related-post-date timestamp">Thu 27 Aug 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/10/15/top-10-six-sigma-black-belt-excuses/">Top 10 Six Sigma Black Belt Excuses</a> <span class="related-post-date timestamp">Thu 15 Oct 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/09/07/implementing-a-six-sigma-initiative-within-a-small-business/">Implementing a Six Sigma Initiative Within a Small Business</a> <span class="related-post-date timestamp">Mon 07 Sep 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/09/02/barloworld-boosts-efficiency-and-customer-satisfaction-with-six-sigma/">Barloworld Boosts Efficiency and Customer Satisfaction with Six Sigma</a> <span class="related-post-date timestamp">Wed 02 Sep 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/08/29/why-six-sigma-will-outlast-total-quality-management/">Why Six Sigma Will Outlast Total Quality Management</a> <span class="related-post-date timestamp">Sat 29 Aug 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/08/26/six-sigma-and-baseball/">Six Sigma and Baseball</a> <span class="related-post-date timestamp">Wed 26 Aug 2009</span></li></ul>]]></content:encoded>
			<wfw:commentRss>http://sixsigmaz.com/2009/11/11/six-sigma-and-minitab-a-complete-toolbox-guide-for-all-six-sigma-practitioners/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Introduction to Six Sigma Statistics</title>
		<link>http://sixsigmaz.com/2009/10/19/introduction-to-six-sigma-statistics/</link>
		<comments>http://sixsigmaz.com/2009/10/19/introduction-to-six-sigma-statistics/#comments</comments>
		<pubDate>Mon, 19 Oct 2009 17:04:26 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[ANOVA]]></category>
		<category><![CDATA[DFSS]]></category>
		<category><![CDATA[DMAIC]]></category>
		<category><![CDATA[Lean]]></category>
		<category><![CDATA[SPC]]></category>
		<category><![CDATA[Training]]></category>
		<category><![CDATA[attribute data]]></category>
		<category><![CDATA[average deviation]]></category>
		<category><![CDATA[belt candidates]]></category>
		<category><![CDATA[continuous data]]></category>
		<category><![CDATA[design of experiment]]></category>
		<category><![CDATA[measures of central tendency]]></category>
		<category><![CDATA[measures of dispersion]]></category>
		<category><![CDATA[minitab]]></category>
		<category><![CDATA[multiple regression analysis]]></category>
		<category><![CDATA[squared deviation]]></category>
		<category><![CDATA[zero value]]></category>

		<guid isPermaLink="false">http://sixsigmaz.com/?p=603</guid>
		<description><![CDATA[I thought I’d bring in some key points here before embarking on a serious Design of Experiment, or multiple regression analysis. I’ve come across several black belt candidates in training (like myself), who get too caught up with the statistical side of things (or Minitab) and forget the basics. I thought 1. Learn the scales [...]]]></description>
			<content:encoded><![CDATA[<p>I thought I’d bring in some key points here before embarking on a serious Design of Experiment, or multiple regression analysis.  I’ve come across several black belt candidates in training (like myself), who get too caught up with the statistical side of things (or Minitab) and forget the basics.  I thought<br />
<strong><br />
1. Learn the scales of measure</strong></p>
<p><span style="text-decoration: underline;">Nominal</span><br />
It is in the name<br />
Marital status, Phone numbers</p>
<p><span style="text-decoration: underline;">Ordinal</span><br />
Relative, unequal value ranking<br />
Race finish, opinion poll response</p>
<p><span style="text-decoration: underline;">Interval</span><br />
Equal intervals are equal differences<br />
Calendar year, Fahrenheit temperature</p>
<p><span style="text-decoration: underline;">Ratio</span><br />
Proportional amount of difference<br />
Has a real zero value<br />
Annual income, Kelvin temperature</p>
<p><strong>2.	Learn the measures of central tendency</strong></p>
<p><span style="text-decoration: underline;">Mean</span>:  Arithmetic average of a set of values<br />
•	Reflects the influence of all values<br />
•	Strongly Influenced by extreme values<br />
<span style="text-decoration: underline;">Mode</span>:  The most frequently occurring value<br />
<span style="text-decoration: underline;">Median</span>:  Midpoint in a string of sorted data, where 50% of the observations, or values, are below and 50% are above<br />
•	Does not necessarily include all values in calculation<br />
•	Is “robust” to extreme scores<br />
•	Organize the data from low values to high when determining the Median</p>
<p><strong>3.	Learn the measures of dispersion</strong></p>
<p>Range: the distance between the extreme values of a data set  (Highest &#8211; Lowest)<br />
The range is more sensitive to outliers than the variance</p>
<p>Variance: the Average Squared Deviation of each data point from the Mean</p>
<p>Standard Deviation: the Square Root of the Variance<br />
measure of the average deviation about the mean</p>
<p><strong>4. Understand the different types of data</strong></p>
<p>Variable (quantative) and Continuous Data (Decimal subdivisions are meaningful)<br />
o	Time (seconds)<br />
o	Pressure (psi)<br />
o	Conveyor Speed (ft/min)<br />
o	Rate (inches</p>
<p>Attribute (qualitative)<br />
o	Categories<br />
o	Good/Bad (Pass/Fail)<br />
o	Machine 1, Machine 2, Machine 3<br />
o	Shift number<br />
o	Counted things (# of Errors in a document, # units shipped, etc.)</p>
<p>Convert Attribute to Continuous wherever possible.  Here are some examples of attribute data converted to continuous data:</p>
<p>–	Count of defects to ‘% defects’<br />
–	Y/N late to ‘average days late’<br />
–	Leaks/No leaks to ‘rate of leaks on a continuous scale’<br />
–	Success or failure of electrical parts to ‘voltage flow of good parts’</p>
<p><strong>5.	Descriptive Statistics</strong></p>
<p>Consists of basic statistics and graphical techniques used to summarize data<br />
Measures of central location<br />
Measures of spread (dispersion)<br />
Evaluation of symmetry &amp; skewness<br />
Typical graphical techniques<br />
-	Histograms<br />
-	Boxplots<br />
-	Dotplots<br />
-	Normal probability plots</p>
<p><strong>6.	Be able to identify different data distributions</strong></p>
<p>Why does it matter?  If you’re treating non-normal data as normal, you’ll get a totally different p-value for your data set, thereby overstating or understating your prediction or analysis.</p>
<p><span style="text-decoration: underline;">Normal Distribution (Bell Curve)</span></p>
<p>The “Normal” Distribution is a distribution of data which has certain consistent properties (the mean, median and mode are equal in value)</p>
<p>These properties are very useful in our understanding of the characteristics of the underlying process from which the data were obtained</p>
<p>Most natural phenomena and man-made processes are distributed normally, or can be represented as normally distributed</p>
<p>The Normal Distribution is a continuous distribution which is symmetrical and extreme values are less likely than moderate values (unimodal)</p>
<p>An example would be measuring heights of people or the length of a table.  In either case the measurement is continuous and can be broken down into finer increments</p>
<p><span style="text-decoration: underline;">t-distribution</span></p>
<p>The t distribution assumes samples are drawn from a normal distribution but the population variance, s2, is not known… The shape of the t-distribution varies as the sample size, n, changes.  The distribution becomes more narrow as the sample size becomes larger. As n becomes very large, the critical value corresponding to the area under the curve approaches the Normal distribution’s Z value</p>
<p><span style="text-decoration: underline;">Poisson Distribution</span></p>
<p>Appropriate as a model of number of defects or nonconformities in a unit of product<br />
X is number of defects found in a per unit basis<br />
–	Per unit area, per unit volume, per unit time, etc.<br />
–	Area X is a discrete, positive integer<br />
–	Area for opportunity is a finite region of space, time or product</p>
<p>When the average is high, the distribution can be approximated by the normal distribution</p>
<p>When the average is low, the distribution is skewed to the right</p>
<p><span style="text-decoration: underline;">F Distribution</span></p>
<p>A continuous distribution formed from the ratio of variances calculated from two independent samples drawn from Normal Distributions</p>
<p><span style="text-decoration: underline;">Chi-square Distribution</span></p>
<p>A continuous distribution used in statistical hypothesis testing and confidence interval estimation for many different applications, including inferences about a population variance</p>
<p><strong>7.	Know about samples and population sizes</strong></p>
<p>Population is every possible observation (census)</p>
<p>Samples are subsets of populations</p>
<p>Data is obtained using samples because we seldom know the entire population</p>
<p>Descriptive statistics apply to any distribution<br />
-	Sample or population</p>
<p>Population statistics are desired, but often not available</p>
<p>Samples from a population can be used to ‘infer’ or approximate population parameters</p>
<p><strong>8.	Know the Statistics and Reporting Tools</strong></p>
<p>By far the most used Six Sigma tools is Microsoft Excel.  Excel provides most of the day to day uses required to manage most Six Sigma projects.  Please note though that Excel must be accompanied by other Microsoft tools such as Word, PowerPoint, and Outlook for email.  In other words, the Six Sigma Black or Green belt candidate should always ensure that he or she has the latest version of Microsoft Office installed.</p>
<p>Also, check with your IT department or on the Microsoft Excel disc for the “Data Analysis Pack” that will allow you, through Excel to perform:</p>
<p>–	Anova<br />
–	Correlations<br />
–	Covariances<br />
–	Descriptive Statistics<br />
–	Exponential Smoothing<br />
–	F-Tests and Two-Samples for Variances<br />
–	Fourier Analysis<br />
–	Histograms<br />
–	Various t-tests</p>
<p>However, the ultimate tool for Six Sigma Black Belts is MiniTab.  Minitab will get you going where Excel leaves you hanging.</p>
<p>–	Minitab will go into profound levels of englightenment with<br />
–	Measurement System Analysis<br />
–	Multivariate Analysis<br />
–	Anova<br />
–	Regression Analysis<br />
–	Statistical Process Control<br />
–	Reliability/Survival Analysis<br />
–	And other great simulations</p>
<p>Don’t forget to check out Minitab’s powerful reporting tools that will provide the following graphical reports:</p>
<p>–	Dotplots / Histograms / Normal Plots<br />
–	Run charts / Time Series<br />
–	Pareto Diagrams<br />
–	Stratification (2nd Level Pareto)<br />
–	Boxplots<br />
–	Scatter Plots<br />
–	Checksheets / Concentration Diagrams</p>
<p>Now you should be ready to go run your multi-var charts, multiple regression, 2k full-factorial regression, and last but not least, your Design of Experiment.  We’ll keep data transformations to another paper.</p>
<p>Please don’t hesitate to drop us a line at admin@sixsigmaz.com if you have any questions or concerns.</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save"><img src="http://sixsigmaz.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a> </p><h4 class='related-posts-header'>Related Posts</h4><ul class="related-posts-list"><li class="related-post"><a href="http://sixsigmaz.com/2009/10/14/measurement-system-analysis-for-beginners/">Measurement System Analysis for Beginners</a> <span class="related-post-date timestamp">Wed 14 Oct 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/11/11/six-sigma-and-minitab-a-complete-toolbox-guide-for-all-six-sigma-practitioners/">Six Sigma and Minitab A complete Toolbox Guide for All Six Sigma Practitioners</a> <span class="related-post-date timestamp">Wed 11 Nov 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/11/04/design-of-experiments-doe-planning-and-some-basic-terms/">Design of Experiments DOE Planning and Some Basic Terms</a> <span class="related-post-date timestamp">Wed 04 Nov 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/10/15/top-10-six-sigma-black-belt-excuses/">Top 10 Six Sigma Black Belt Excuses</a> <span class="related-post-date timestamp">Thu 15 Oct 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/10/01/minitabs-latest-quality-companion-software-update-adds-value-stream-mapping/">Minitab's Latest Quality Companion Software Update Adds Value Stream Mapping</a> <span class="related-post-date timestamp">Thu 01 Oct 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/09/07/the-impact-of-six-sigma-training-certification/">The Impact of Six Sigma Training & Certification</a> <span class="related-post-date timestamp">Mon 07 Sep 2009</span></li></ul>]]></content:encoded>
			<wfw:commentRss>http://sixsigmaz.com/2009/10/19/introduction-to-six-sigma-statistics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Introduction to Design of Experiments</title>
		<link>http://sixsigmaz.com/2009/09/21/introduction-to-design-of-experiments/</link>
		<comments>http://sixsigmaz.com/2009/09/21/introduction-to-design-of-experiments/#comments</comments>
		<pubDate>Mon, 21 Sep 2009 16:54:23 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[ANOVA]]></category>
		<category><![CDATA[Lean]]></category>
		<category><![CDATA[SPC]]></category>
		<category><![CDATA[Six Sigma News]]></category>
		<category><![CDATA[Design of Experiments]]></category>
		<category><![CDATA[doe]]></category>

		<guid isPermaLink="false">http://sixsigmaz.com/?p=419</guid>
		<description><![CDATA[The Basic Concepts Underlying DOE Six Sigma stresses making alterations in existing business processes for improving overall efficiency. To make effective alterations, it is important to first understand the various aspects of business processes so that the cause and effect relationship between various processes can be determined. However, this is easier said than done because [...]]]></description>
			<content:encoded><![CDATA[<p>The Basic Concepts Underlying DOE</p>
<p>Six Sigma stresses making alterations in existing business processes for improving overall efficiency. To make effective alterations, it is important to first understand the various aspects of business processes so that the cause and effect relationship between various processes can be determined. However, this is easier said than done because most business processes comprise of multiple sub processes, which themselves are quite complicated.</p>
<p>Making alterations in a simple business process having just two to three sub-processes is easy, but the task becomes quite difficult when alterations are to be made in a business process having more than ten sub-processes. The level of difficulty goes on increasing with an increase in the number of sub-processes. This is where DOE is beneficial as it can easily crawl through vast amounts of data that is generated as part of the six-sigma implementation process. This data is meant for understanding the cause and effect relationship between various processes and sub-processes, but since the amount of data generated is quite huge, advanced tools such as DOE are used.</p>
<p>Why Is DOE Necessary?</p>
<p>DOE analyses the data and generates quantifiable results, which are then used for defining the type of experiments or alterations that are to be conducted for achieving Six Sigma quality levels. Conducting pre-tests or experiments is necessary in Six Sigma because the organizational efficiency and productivity are at stake while the implementations are going on. If Six Sigma concepts and methodologies are not implemented properly, it can seriously affect the company’s bottom-line and lead to redundancies.</p>
<p>Another thing is that organizations cannot hope to achieve desired results just by conducting wayward and misguided experiments or alterations. DOE is effective because it helps Six Sigma professionals in selecting the most suitable experiment designs that in turn would help in achieving the desired results. In the absence of DOE, it would be quite difficult to ascertain the type of experiments that are to be conducted.</p>
<p>Other Uses And Benefits</p>
<p>DOE can also be used for understanding the root cause of variations in a business process. Business processes are designed for delivering exactly the same quality and quantity, irrespective of what they are being used for, be it for manufacturing a product or rendering a service. However, maintaining this level of consistency is not always possible because the efficiency of business processes can be affected by various variable factors that are quite difficult to decipher.</p>
<p>This is where DOE can help because it can analyze vast amounts of data related to the variations and generate results that can be used for pointing out the root cause of variations. Once the variations have been eliminated, it would become quite easy for Six Sigma professionals to successfully carry out the implementations. DOE is thus an inseparable part of Six Sigma implementations in any type of industry.</p>
<p></P><br />
Tony Jacowski is a quality analyst for The MBA Journal.  Aveta Solution’s Six Sigma Online offers online six sigma training and certification classes for lean six sigma, black belts, green belts, and yellow belts.</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save"><img src="http://sixsigmaz.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a> </p><h4 class='related-posts-header'>Related Posts</h4><ul class="related-posts-list"><li class="related-post"><a href="http://sixsigmaz.com/2009/08/26/six-sigma-and-baseball/">Six Sigma and Baseball</a> <span class="related-post-date timestamp">Wed 26 Aug 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/11/11/six-sigma-and-minitab-a-complete-toolbox-guide-for-all-six-sigma-practitioners/">Six Sigma and Minitab A complete Toolbox Guide for All Six Sigma Practitioners</a> <span class="related-post-date timestamp">Wed 11 Nov 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/08/24/normality-assumption-explained/">Normality Assumption Explained</a> <span class="related-post-date timestamp">Mon 24 Aug 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/11/04/design-of-experiments-doe-planning-and-some-basic-terms/">Design of Experiments DOE Planning and Some Basic Terms</a> <span class="related-post-date timestamp">Wed 04 Nov 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/10/11/what-is-six-sigma/">What is Six Sigma?</a> <span class="related-post-date timestamp">Sun 11 Oct 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/10/02/how-to-explain-the-six-sigma-approach-in-a-minute-or-less/">How to Explain the Six Sigma Approach in a Minute or Less</a> <span class="related-post-date timestamp">Fri 02 Oct 2009</span></li></ul>]]></content:encoded>
			<wfw:commentRss>http://sixsigmaz.com/2009/09/21/introduction-to-design-of-experiments/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Six Sigma and Baseball</title>
		<link>http://sixsigmaz.com/2009/08/26/six-sigma-and-baseball/</link>
		<comments>http://sixsigmaz.com/2009/08/26/six-sigma-and-baseball/#comments</comments>
		<pubDate>Thu, 27 Aug 2009 02:01:13 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[ANOVA]]></category>
		<category><![CDATA[DMAIC]]></category>
		<category><![CDATA[SPC]]></category>
		<category><![CDATA[VOC]]></category>
		<category><![CDATA[Cpk]]></category>
		<category><![CDATA[doe]]></category>
		<category><![CDATA[house of quality]]></category>
		<category><![CDATA[iso9001]]></category>
		<category><![CDATA[Kano]]></category>
		<category><![CDATA[Malcolm Baldrige]]></category>
		<category><![CDATA[qfd]]></category>
		<category><![CDATA[tqm]]></category>
		<category><![CDATA[y=f(x)]]></category>

		<guid isPermaLink="false">http://sixsigmaz.com/?p=178</guid>
		<description><![CDATA[With a much anticipated and needed vacation coming to an end I thought I would deliver on my personal promise and write a short article in order to better organize my 6s thoughts, which have been nothing less than tumultuous lately. I&#8217;ve been troubled for a few months now regarding the future of Six Sigma [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_181" class="wp-caption alignright" style="width: 210px"><a href="http://sixsigmaz.com/wp-content/uploads/2009/08/sixsigmabaseball.jpg" rel="lightbox[178]"><img class="size-full wp-image-181" title="Six Sigma Baseball" src="http://sixsigmaz.com/wp-content/uploads/2009/08/sixsigmabaseball.jpg" alt="sixsigmabaseball" width="200" height="265" /></a><p class="wp-caption-text">Six Sigma Baseball</p></div>
<p>With a much anticipated and needed vacation coming to an end I thought I would deliver on my personal promise and write a short article in order to better organize my 6s thoughts, which have been nothing less than tumultuous lately.</p>
<p>I&#8217;ve been troubled for a few months now regarding the future of Six Sigma and what direction my career should (could) take.  If Motorola introduced what seemed to be the next best thing since sliced bread (baked, packaged, and delivered a la ISO9001 of course) and TQM, why hasn&#8217;t widespread adoption outside the business world occurred?</p>
<p>If Six Sigma is good enough for Motorola, GE, and Tyco and thousands of others, why isn&#8217;t it good enough for our governments, schools, and hospitals?</p>
<p>Malcolm Baldrige compensates for the lack of 6s in the healthcare industry but where are they with statistical analysis and SPC?</p>
<p>During my holidays I read a good book which seems to have piqued my boss&#8217;s interest for the shear Billy Beane Factor:  Moneyball:  The Art of Winning an Unfair Game by Michael Lewis, ISBN-13: 978-0393324815, published by W.W. Norton &amp; Co.</p>
<p>Moneyball goes to great lengths to explain the Oakland A&#8217;s success at seasonal ball game wins on the lowest payroll in Big League ball.</p>
<p>As it turns out, the problem in professional baseball was that everyone was looking at the wrong stats.  Let me take that back.  More relevant stats needed to be made available and scouts had to stop recruiting on emotion and gut feeling and identify true talent.  Up until then it didn&#8217;t seem that any statistical analysis had been extensively correlated.  This screams measurement system analysis (MSA) to me.</p>
<p>Bill James realized this in the late seventies and his annual <em>Baseball Abstract</em> provided the Oakland A&#8217;s the necessary data and approach to the first non-official Six Sigma Baseball project.</p>
<p>Some of you are probably wondering where I&#8217;m going with this, or have read Moneyball (no mention of Six Sigma or SPC for that matter) but hold on:</p>
<p>Isn&#8217;t Six Sigma just another means to deliver the best possible perhaps even perfect? product, process, or service as possible at the lowest cost possible as quickly as possible? It&#8217;s as simple as y=f(x).  Don&#8217;t bore me with Voice of the Customer (VOC), Quality Function Deployments (QFD) and Houses of Quality, Kanos, Designs of Experiments (DOE), Analysis of Variances (ANOVA), etc.  I don’t want to oversimplify and denigrate the Six Sigma approach, however, let’s not complicate things either.</p>
<p>Here’s my DMAIC take on baseball.</p>
<p><strong>Define</strong></p>
<p>What are the Oakland A’s trying to accomplish?  World Series?  More ticket sales?  Increased revenue from merchandising?  Happier fans translates into increased ticket sales.</p>
<p>The catch here is that if fans want to see slugging and home runs, this doesn’t necessarily translate into wins.  Read Moneyball to find out how fans react when established sluggers were told to walk bases.</p>
<p><strong>Measure:</strong></p>
<p>What are we measuring?  Errors, On-Base Percentages? Runs? Strike-outs?</p>
<p><strong>Analyze</strong></p>
<p>What are the correlations that we need to identify to better analyze the game and increase wins.  What works now may not work in the playoffs.</p>
<p>Understand that a batting average of .288 in the season, may actually be .201 when a certain batter faces a certain pitcher.</p>
<p>Do change-up pitches always work with the same batters?  Does the speed of a fastball correlate to the number of strikeouts?</p>
<p><strong>Improve</strong></p>
<p>How do we make better trades and optimize the drafting process?  Will a first-round draft pick be necessarily better than a sixteenth-round pick?</p>
<p><strong>Control</strong></p>
<p>Remember the Upper and Lower Control Limits Specifications for whatever it is you’re monitoring.  The LCL and UCL can apply to the strike zone, the acceptable number of men on base with two outs, the batter’s count, etc.</p>
<p>Consider a pinch-hitter or a relief pitcher.</p>
<p><strong>Conclusion</strong></p>
<p>While much work has been done over the past twenty years with regards to the Six Sigma in improving manufacturing, product design, and service industries, with increasingly more resources being made available with software engineering, healthcare, and financial sectors, where are we with 6s in our personal lives?  Wouldn’t it be great if we had our own personal toolboxes our neighbors used</p>
<p>Would anyone be so kind as to demonstrate the Cpk or sigma level of a baseball team, recruiting process, or any other interestingly fun spin on Six Sigma and baseball?  Many Six Sigma practitioners are responsible for increasing efficiencies and reducing costs by hundreds of percentage points.  Has anyone begun simple projects to monitor their fuel economy on their way to work, or find more efficient ways of saving time on their daily subway trip to work?</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save"><img src="http://sixsigmaz.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a> </p><h4 class='related-posts-header'>Related Posts</h4><ul class="related-posts-list"><li class="related-post"><a href="http://sixsigmaz.com/2009/09/01/introduction-to-msa-measurement-systems-analysis/">Introduction to MSA Measurement Systems Analysis</a> <span class="related-post-date timestamp">Tue 01 Sep 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/08/29/six-sigma-strategies-creating-excellence-in-the-workplace/">Six Sigma Strategies - Creating Excellence in the Workplace</a> <span class="related-post-date timestamp">Sat 29 Aug 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/08/29/why-six-sigma-will-outlast-total-quality-management/">Why Six Sigma Will Outlast Total Quality Management</a> <span class="related-post-date timestamp">Sat 29 Aug 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/08/23/six-sigma-servqual-windows-on-the-bay-restaurant/">Six Sigma Servqual | Windows On The Bay Restaurant</a> <span class="related-post-date timestamp">Sun 23 Aug 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/11/11/six-sigma-and-minitab-a-complete-toolbox-guide-for-all-six-sigma-practitioners/">Six Sigma and Minitab A complete Toolbox Guide for All Six Sigma Practitioners</a> <span class="related-post-date timestamp">Wed 11 Nov 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/09/21/introduction-to-design-of-experiments/">Introduction to Design of Experiments</a> <span class="related-post-date timestamp">Mon 21 Sep 2009</span></li></ul>]]></content:encoded>
			<wfw:commentRss>http://sixsigmaz.com/2009/08/26/six-sigma-and-baseball/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Six Sigma Glossary &amp; Definitions</title>
		<link>http://sixsigmaz.com/2009/08/26/six-sigma-glossary-definitions/</link>
		<comments>http://sixsigmaz.com/2009/08/26/six-sigma-glossary-definitions/#comments</comments>
		<pubDate>Wed, 26 Aug 2009 11:06:56 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[ANOVA]]></category>
		<category><![CDATA[VOC]]></category>
		<category><![CDATA[CTQ]]></category>
		<category><![CDATA[Definitions]]></category>
		<category><![CDATA[DFSS]]></category>
		<category><![CDATA[Glossary]]></category>
		<category><![CDATA[t-test]]></category>

		<guid isPermaLink="false">http://sixsigmaz.com/?p=164</guid>
		<description><![CDATA[Alpha Risk &#8211; Risk of concluding that two characteristics are different when they are actually the same. Alternative Hypothesis &#8211; Statement of change or difference. The statement is assumed correct if the null hypothesis is not supported by the data. ANOVA &#8211; Analysis of Variance &#8211; Hypothesis test used to determine whether two or more [...]]]></description>
			<content:encoded><![CDATA[<p><strong>Alpha Risk &#8211; </strong>Risk of concluding that two characteristics are different when they are actually the same.</p>
<p><strong> </strong></p>
<p><strong>Alternative Hypothesis &#8211; </strong>Statement of change or difference. The statement is assumed correct if the null hypothesis is not supported by the data.</p>
<p><strong> </strong></p>
<p><strong>ANOVA &#8211; Analysis of Variance &#8211; </strong>Hypothesis test used to determine whether two or more sample means are significantly different.</p>
<p><strong> </strong></p>
<p><strong>Attribute Data &#8211; </strong>Sometimes used in place of the term discrete data. Attribute data is qualitative data that has two outcomes. Examples include pass/fail, tastes good/tastes bad and acceptable/not acceptable. Attribute data is used specifically in reference to Attribute Control Charts, which are limited to counting and plotting defects or defectives.</p>
<p><strong> </strong></p>
<p><strong>Attribute Gage R&amp;R &#8211; </strong>A Gage Repeatability &amp; Reproducibility (Gage R&amp;R) study used to determine the measurement variability when attribute data is used.</p>
<p><strong> </strong></p>
<p><strong>Backbone Test &#8211; </strong>A main supporting tool or factor used in statistical analysis.</p>
<p><strong> </strong></p>
<p><strong>Bartlett</strong><strong>&#8216;s Test &#8211; </strong>Type of hypothesis test that compares variances of two or more normally distributed populations.</p>
<p><strong> </strong></p>
<p><strong>Baseline &#8211; </strong>The &#8220;as-is&#8221; level of performance of the process being investigated.</p>
<p><strong> </strong></p>
<p><strong>Baseline Control Chart &#8211; </strong>A graph of the initial process performance over time that is used to detect whether common or special cause variation exists in a process.</p>
<p><strong> </strong></p>
<p><strong>Beta Risk &#8211; </strong>Risk of concluding that two characteristics are the same when they are actually different. <strong> </strong></p>
<p><strong> </strong></p>
<p><strong>Between Variance &#8211; </strong>Between variance is based on the distances of the sample means from the overall mean. If the samples come from populations with equal means, one would not expect an undue amount of variation in the sample means.</p>
<p><strong> </strong></p>
<p><strong>Bias &#8211; </strong>Bias within a measurement system describes the difference between the observed average and a reference value of the measurements. Bias is related to accuracy.</p>
<p><strong> </strong></p>
<p><strong>Binomial Distribution &#8211; </strong>A distribution of probabilities used when there are only two independent outcomes to the hypothesis test (e.g. heads/tails, yes/no).</p>
<p><strong> </strong></p>
<p><strong>Box Plot &#8211; </strong>A type of plot which identifies variability and centering based on quartiles. It uses a rectangular box to represent the middle 50% of the data and whiskers to show the extent of the data. A Box Plot requires data from a continuous output and a discrete (typically attribute) input at one or more levels. <strong> </strong></p>
<p><strong> </strong></p>
<p><strong>C Chart &#8211; </strong>A graphical display of number (counts) of defects in a subgroup (part or unit) with a constant sample size. A C Chart can be used to detect special cause variation.</p>
<p><strong> </strong></p>
<p><strong>Capability Analysis &#8211; </strong>A statistical measure which helps determine how well a process meets customers&#8217; standards.</p>
<p><strong> </strong></p>
<p><strong>Capability Indices &#8211; </strong>Indices describing the overall effectiveness of a process in meeting specific criteria in both the short and long term.</p>
<p><strong> </strong></p>
<p><strong>C&amp;E Analysis &#8211; Cause and Effect Analysis &#8211; </strong>The Cause and Effect Analysis (C&amp;E Analysis) is an analysis in which either causes to an effect are brainstormed (the Cause and Effect Diagram), or multiple causes, which have already been identified, are prioritized against several criteria (the Cause and Effect Matrix). C&amp;E Analyses are typically used when limited data is available</p>
<p><strong> </strong></p>
<p><strong>C&amp;E Diagram &#8211; Cause and Effect Diagram &#8211; </strong>The Cause and Effect Diagram (C&amp;E Diagram) is a fishbone-shaped diagram that helps a team brainstorm potential root causes of a defect. The problem to be solved is placed at the head of the fish and the six thought-generating categories &#8211; Personnel, Machines, Materials, Methods, Measurements and Environment &#8211; are placed at the ends of the fish bones.</p>
<p><strong> </strong></p>
<p><strong>C&amp;E Matrix &#8211; Cause and Effect Matrix &#8211; </strong>The Cause and Effect Matrix (C&amp;E Matrix) is a tool that allows one to prioritize many items when there are many criteria they need to be prioritized against.</p>
<p><strong> </strong></p>
<p><strong>Chi-Square Distribution &#8211; </strong>A sampling distribution used to determine the confidence interval for standard deviation and to perform Chi-Square Hypothesis Tests: Goodness of Fit Test and Test for Association.</p>
<p><strong> </strong></p>
<p><strong>Chi-Square Hypothesis Test &#8211; </strong>Hypothesis test used to test the goodness-of-fit between a sample and a hypothesized distribution, or the association of two or more variables.</p>
<p><strong> </strong></p>
<p><strong>Classical Yield &#8211; </strong>The ratio of the number of units that ultimately pass through the entire process to the number of units that enter into the process. See also Final Yield, which equals output divided by input.</p>
<p><strong> </strong></p>
<p><strong>Clustering &#8211; </strong>A type of special cause variation within a Run Chart in which the data set exhibits fewer runs around the median than expected, resulting in a &#8220;clustering&#8221; of data points in just a small range of values or a shift in the average values.</p>
<p><strong> </strong></p>
<p><strong>Coefficient of Determination &#8211; </strong>A measure of the correlation between the dependent and independent variables in a Regression Analysis.</p>
<p><strong> </strong></p>
<p><strong>Common Cause Variation &#8211; </strong>Process variability that is free of assignable cause. It is typically associated with short-term variability or subgroup variability. Also referred to as white noise or expected variation.</p>
<p><strong> </strong></p>
<p><strong>Confidence Interval &#8211; </strong>A range of numbers in which population parameters are likely to fall.</p>
<p><strong> </strong></p>
<p><strong>Confidence Level &#8211; </strong>The fixed probability of correctly accepting the null hypothesis.</p>
<p><strong> </strong></p>
<p><strong>Contingency Table &#8211; </strong>A table of observed frequencies used in Chi-Square Hypothesis Tests.</p>
<p><strong> </strong></p>
<p><strong>Continuous Data &#8211; </strong>Continuous data, sometimes referred to as variable data, is data that is measured on a continuum or a scale that can be meaningfully divided into finer and finer increments of precision. For example, length and weight can be measured to any desired level of precision.</p>
<p><strong> </strong></p>
<p><strong>Continuous Gage R&amp;R &#8211; </strong>A Gage Repeatability &amp; Reproducibility (Gage R&amp;R) study used to determine the measurement variability when continuous data is used.</p>
<p><strong> </strong></p>
<p><strong>Control Chart &#8211; </strong>A process control tool, in which data is plotted and statistically analyzed in order to discern whether the process exhibits common and/or special cause variation.</p>
<p><strong> </strong></p>
<p><strong>Control Limit &#8211; </strong>Control Limits are typically set +/-3 standard deviations from the centerline to determine whether or not the process is in control.</p>
<p><strong> </strong></p>
<p><strong>Control Plan &#8211; </strong>A single document or set of documents that provides a point of reference among the Key Process Input Variables (KPIV&#8217;s), Key Process Output Variables (KPOV&#8217;s), specifications and instructions for the completed project. It documents the actions, including schedules and responsibilities that are needed to control the KPIV&#8217;s at their optimal settings.</p>
<p><strong> </strong></p>
<p><strong>Controllable Input &#8211; </strong>A process input that has adjustable settings, which can be modified by the project team during a project.</p>
<p><strong> </strong></p>
<p><strong>Correlated &#8211; </strong>Data identified as having a causal, complementary, parallel or reciprocal relationship.</p>
<p><strong> </strong></p>
<p><strong>Correlation &#8211; </strong>Statistical analysis that determines whether or not one variable can be used to predict another. Correlation does not necessarily prove causation.</p>
<p><strong> </strong></p>
<p><strong>Correlation Coefficient &#8211; </strong>A measure of the interdependence of two random variables that range in value from -1 to +1, indicating perfect negative correlation at -1, absence of correlation at zero and perfect positive correlation at +1.</p>
<p><strong> </strong></p>
<p><strong>Cp &#8211; </strong>A capability index that measures the potential capability of a process to meet expected specification limits, or tolerance levels, in the short term, assuming the process is ideally centered (regardless of where the process is actually centered).</p>
<p><strong> </strong></p>
<p><strong>Cpk &#8211; </strong>A capability index that measures the potential capability of a current process to meet expected specification limits, or tolerance levels, in the short term, using the current process average.</p>
<p><strong> </strong></p>
<p><strong>Critical Input Variable &#8211; </strong>An input variable (X) that has been statistically proven to impact the process output (Y).</p>
<p><strong> </strong></p>
<p><strong>CTC &#8211; Critical to Cost &#8211; </strong>Reference to a product, service and/or transactional characteristic that significantly influences a customer in terms of cost.</p>
<p><strong> </strong></p>
<p><strong>CTD &#8211; Critical to Delivery &#8211; </strong>Reference to a product, service and/or transactional characteristic that significantly influences a customer in terms of delivery.</p>
<p><strong> </strong></p>
<p><strong>CTQ &#8211; Critical to Quality &#8211; </strong>Reference to a product, service and/or transactional characteristic that significantly influences a customer in terms of quality.</p>
<p><strong> </strong></p>
<p><strong>CTS &#8211; Critical to Satisfaction &#8211; </strong>Expression of customers&#8217; vital needs and can include any of the CTC, CTQ and CTD requirements.</p>
<p><strong> </strong></p>
<p><strong>Curvilinear Relationship &#8211; </strong>A quadratic relationship evident between two variables, that approximates a curved line on a Scatter Plot. It indicates that one variable depends on the squared value of the other.</p>
<p><strong> </strong></p>
<p><strong>Customer Specification Limits &#8211; </strong>Customer-defined limits for acceptable outputs. When a measured value falls outside these limits, a defect has been created.</p>
<p><strong> </strong></p>
<p><strong>DPM &#8211; Defects per Million &#8211; </strong>Measure of total defects per million units.</p>
<p><strong> </strong></p>
<p><strong>DPMO &#8211; Defects per Million Opportunities &#8211; </strong>Equals the total number of defects per unit divided by the total number of opportunities for defects per unit multiplied by 1,000,000.</p>
<p><strong> </strong></p>
<p><strong>DPO -</strong> <strong>Defects per Opportunity &#8211; </strong>Measure of the total defects per unit divided by the opportunities per unit.</p>
<p><strong> </strong></p>
<p><strong>DPU &#8211; Defects per Unit &#8211; </strong>Calculated by dividing the total number of defects by the total number of units. Also used to calculate Rolled Throughput Yield.</p>
<p><strong> </strong></p>
<p><strong>Degrees of Freedom &#8211; </strong>Degrees of freedom (df) of a test statistic equals the number of independent observations in a sample minus the number of population parameters that are estimated from the sample observation.</p>
<p><strong> </strong></p>
<p><strong>DFSS &#8211; Design for Six Sigma – </strong>Systematic methodology for incorporating VOC at product design time ensuring a Six Sigma level</p>
<p><strong> </strong></p>
<p><strong>DOE &#8211; Design of Experiments &#8211; </strong>Planned experiments that allow for the simultaneous statistical analysis of several PIVs to determine their effects on any measurable POV. Used to prove correlation and causation.</p>
<p><strong> </strong></p>
<p><strong>Deviation &#8211; </strong>The distance between a data point and the mean. Deviation measures and describes the variation in a set of data.</p>
<p><strong> </strong></p>
<p><strong>Discrete Data &#8211; </strong>Numeric data that is not capable of being meaningfully subdivided into more precise increments.</p>
<p><strong> </strong></p>
<p><strong>Discrimination &#8211; </strong>The ability of the measurement system to adequately detect the smallest tolerable changes within the process.</p>
<p><strong> </strong></p>
<p><strong>Distribution &#8211; </strong>A pattern or tendency depicted by randomly collected observations from a population.</p>
<p><strong> </strong></p>
<p><strong>DMAIC – Design, Measure, Analyze, Improve, Control &#8211; </strong>Six Sigma methodology applied to manufacturing or production processes.</p>
<p><strong> </strong></p>
<p><strong>Empirical Rule &#8211; </strong>A rule derived from observations that the probabilities of empirical (real world) distributions will approximate to 68%, 95% and 99.7% of the values within one, two and three standard deviations of the mean, even though the real-world distributions may not be perfectly normal.</p>
<p><strong> </strong></p>
<p><strong>FMEA &#8211; Failure Mode and Effects Analysis &#8211; </strong>Failure Mode and Effects Analysis is an analytical technique that allows a project team to ensure that, to the extent possible, potential failure modes and their associated causes/mechanisms have been evaluated and addressed to mitigate any risk to the customer.</p>
<p><strong> </strong></p>
<p><strong>Final Yield &#8211; </strong>Final Yield (YF) is a measure of the percentage of units that passed the final process test relative to the number of units that entered the process.</p>
<p><strong> </strong></p>
<p><strong>F-Ratio &#8211; </strong>The result of an F-Test. It indicates the measure of between-to-within variation.</p>
<p><strong> </strong></p>
<p><strong>Frequency &#8211; </strong>The ratio of the number of times an event occurs in a series of trials in a random experimental trial to the total number of trials in that experiment.</p>
<p><strong> </strong></p>
<p><strong>Frequency Distribution &#8211; </strong>A display indicating how often a particular observation or data value occurs and representing the distribution of data.</p>
<p><strong> </strong></p>
<p><strong>F-Test &#8211; </strong>Type of hypothesis test used to determine whether or not the within variance and between variance are the same.</p>
<p><strong> </strong></p>
<p><strong>Gage Control Plan &#8211; </strong>Documentation describing the strategy employed to ensure the reliability and adequacy of the measurement system in measuring input variables and monitoring output variables over the long term.</p>
<p><strong> </strong></p>
<p><strong>Hypothesis Test &#8211; </strong>A test in which the project team assumes an initial claim, the null hypothesis, to be true and then tests this claim against an alternative hypothesis using sample data.</p>
<p><strong> </strong></p>
<p><strong>I-MR Chart &#8211; </strong>A type of Control Chart for variable data that plots individual data and the moving range of the present and previous individuals.</p>
<p><strong> </strong></p>
<p><strong>Interaction Plot &#8211; </strong>A type of plot that graphs the averages of the output variable for each level of a factor, with the level of a second factor held constant for all combinations of levels. Interaction Plots readily show the presence of interactions; parallel lines in an Interaction Plot indicate no interaction, while greater departure from the parallel state indicates a higher degree of interaction.</p>
<p><strong> </strong></p>
<p><strong>KPIV &#8211; Key Process Input Variable &#8211; </strong>An input (X) that has been determined to have a statistically significant and causal relationship to the KPOV.</p>
<p><strong> </strong></p>
<p><strong>KPOV &#8211; Key Process Output Variable – </strong>Any outputs from a process that satisfies CTS  requirements.</p>
<p><strong> </strong></p>
<p><strong>Linearity &#8211; </strong>The ability of the measurement system to measure over its operating range with minimal bias.</p>
<p><strong> </strong></p>
<p><strong>LCL &#8211; Lower Control Limit &#8211; </strong>Limit in the Control Chart that is set below the centerline, typically at -3 standard deviations, in order to determine whether or not the process is in control.</p>
<p><strong> </strong></p>
<p><strong>LSL &#8211; Lower Specification Limit &#8211; </strong>Limit of a tolerance range specified by a CTS. A measured value that is below this limit is considered a defect.</p>
<p><strong> </strong></p>
<p><strong>Mean &#8211; </strong>A measure of the central tendency of a data set. It is the sum of a set of values divided by the number of summed values.  Same as average.</p>
<p><strong> </strong></p>
<p><strong>MSA &#8211; Measurement Systems Analysis &#8211; </strong>A series of designed tests used to assess measurement system capability.</p>
<p><strong> </strong></p>
<p><strong>Measurement Variability &#8211; </strong>The net effect of all the sources of measurement error that cause an observed value to deviate from the true value of the characteristic that is being measured.</p>
<p><strong> </strong></p>
<p><strong>Median – </strong>Center or middle number in a group of rank-ordered numbers (1, 4, <strong>5</strong>, 9, 14)</p>
<p><strong> </strong></p>
<p><strong>Mode &#8211; </strong>The most frequently observed value in a dataset.</p>
<p><strong> </strong></p>
<p><strong>Multiple Linear Regression &#8211; </strong>A linear regression with two or more predictors. The estimation of the output variable from two or more continuous input variables using a linear relationship (straight line or plane) between the output and input variables.</p>
<p><strong> </strong></p>
<p><strong>Noise &#8211; </strong>The Process Input Variables (PIV&#8217;s) with settings that cannot be adjusted or controlled by the project team.</p>
<p><strong> </strong></p>
<p><strong>Normal Distribution &#8211; </strong>A distribution of data described by the mean and standard deviation. The curve displaying the distribution of data is shaped like a bell, with the area under the curve representing 100% of all possible observations.</p>
<p><strong> </strong></p>
<p><strong>Normal Probability Plot &#8211; </strong>A type of plot that represents normally distributed sample data.</p>
<p><strong> </strong></p>
<p><strong>Normalized Yield &#8211; </strong>Normalized Yield (YNORM) is the equalized (same) yield assigned for each process step; it is the geometric average of the Rolled Throughput Yield (RTY) for the entire process.</p>
<p><strong> </strong></p>
<p><strong>NP Chart &#8211; </strong>This tool tracks the number of defectives (products, parts or units that do not conform to specified standards) in each subgroup and detects the presence of special cause variation. It can only be used when the sample size is constant.</p>
<p><strong> </strong></p>
<p><strong>Null Hypothesis &#8211; </strong>A statement about population parameters, typically implying &#8220;no effect&#8221; or &#8220;no difference.&#8221; This statement is assumed true until sufficient evidence is presented otherwise.</p>
<p><strong> </strong></p>
<p><strong>OFAT &#8211; One-Factor-at-a-Time &#8211; </strong>One-Factor-at-a-Time (OFAT) is an experimental design setup, in which each factor is varied one at a time while the remaining factors are held constant. OFAT is done in order to estimate the effect of a single variable on selected fixed conditions of other variables.</p>
<p><strong> </strong></p>
<p><strong>Outlier &#8211; </strong>Data point that is markedly inconsistent with the rest of the data set.</p>
<p><strong> </strong></p>
<p><strong>Output &#8211; </strong>The result of a process or product and its measurable characteristics. Also called Process Output Variable (POV).</p>
<p><strong> </strong></p>
<p><strong>P Chart &#8211; </strong>A P Chart tracks the proportion of defectives (products, parts or units that do not conform to specified standards) in each subgroup and detects the presence of special cause variation. It can be used for sample sizes that are either constant or variable.</p>
<p><strong> </strong></p>
<p><strong>Paired Sample &#8211; </strong>A sample in which each sampling unit contains a pair of observations that are not independent of one another (e.g. sampling ages of husband and wife as a single sampling unit).</p>
<p><strong> </strong></p>
<p><strong>Paired t Test &#8211; </strong>A type of hypothesis test used when analyzing the difference between the means obtained from paired samples.</p>
<p><strong> </strong></p>
<p><strong>Pareto Chart &#8211; </strong>Type of chart that compares the frequency and/or influence of various types of problems or causes of a problem. The horizontal axis represents the different categories, and the height of the bar represents the frequency of that category. Pareto Charts prioritize possible areas for improvement and require either discrete or continuous data.</p>
<p><strong> </strong></p>
<p><strong>Pareto Principle &#8211; </strong>Also referred to as Pareto Effect, the Pareto Principle states that 80% of the issues are the result of 20% percent of the causes.</p>
<p><strong> </strong></p>
<p><strong>Pearson Correlation &#8211; </strong>The most commonly used method of computing a correlation coefficient between variables that are linearly related.</p>
<p><strong> </strong></p>
<p><strong>Population &#8211; </strong>The entire collection or set of objects or individuals from which a sample is drawn for analysis. Population can also mean a set of values or characteristics of the members of the population, e.g. heights of people in the world.</p>
<p><strong> </strong></p>
<p><strong>Pp &#8211; </strong>A capability index that measures the potential capability of a process to meet expected specification limits, or tolerance levels, in the long term, assuming the process is ideally centered (regardless of where the process is actually centered).</p>
<p><strong> </strong></p>
<p><strong>Ppk &#8211; </strong>A capability index that measures the potential capability of a current process to meet expected specification limits, or tolerance levels, in the long term, using the current process average.</p>
<p><strong> </strong></p>
<p><strong>PIV &#8211; Process Input Variable &#8211; </strong>A Process Input Variable (PIV) is a characteristic of materials, equipment, information or any other resource that is needed to carry out a process. In other words, PIV&#8217;s are the X&#8217;s and potential X&#8217;s in the Y = f(x) equation.</p>
<p><strong> </strong></p>
<p><strong>POV &#8211; Process Output Variable &#8211; </strong>A Process Output Variable (POV) is a characteristic of a product or service that is created by the process and is passed onto the next process step or the customer. In other words, the POV is the Y in the Y = f(x) equation.</p>
<p><strong> </strong></p>
<p><strong>Process Specification Limits &#8211; </strong>Limits that reflect the customer specifications as allocated to the inputs and outputs of a process. Process Specification Limits apply either to Process Input Variables (PIV&#8217;s) or to Process Output Variables (POV&#8217;s). When a PIV falls outside these limits, the process may not function as designed. When a POV falls outside these limits, the process produces a defect.</p>
<p><strong> </strong></p>
<p><strong>Proportion &#8211; </strong>The percentage of a data set that has a specific characteristic of interest.</p>
<p><strong> </strong></p>
<p><strong>Proportion of Sample &#8211; </strong>The fraction of samples which exhibit a characteristic of interest.</p>
<p><strong> </strong></p>
<p><strong>p-Value &#8211; </strong>The probability of rejecting the null hypothesis when it is true.</p>
<p><strong> </strong></p>
<p><strong>QFD &#8211; Quality Function Deployment &#8211; </strong>Quality Functional Deployment (QFD) is a systematic methodology to integrate the Voice of the Customer (VOC) into the design and delivery of goods and services.</p>
<p><strong> </strong></p>
<p><strong>Quartile &#8211; </strong>The four equal parts into which a rank-ordered data set can be divided.</p>
<p><strong> </strong></p>
<p><strong>Range &#8211; </strong>A measure of variation within a data set. It is calculated by subtracting the minimum value from the maximum value.</p>
<p><strong> </strong></p>
<p><strong>Regression Analysis &#8211; </strong>A statistical technique used to investigate relationships between the output variable and one or more input variables.</p>
<p><strong> </strong></p>
<p><strong>Repeatability &#8211; </strong>The extent to which repeated measurements, made on the same item under absolutely identical conditions, produce the same result.</p>
<p><strong> </strong></p>
<p><strong>Reproducibility &#8211; </strong>The extent to which repeated measurements, made on the same item under different conditions or by different people, produce the same result.</p>
<p><strong> </strong></p>
<p><strong>Resolution &#8211; </strong>Otherwise known as discrimination, the ability of the measurement system to adequately detect the smallest tolerable changes within the process.</p>
<p><strong> </strong></p>
<p><strong>Response Variable &#8211; </strong>An output variable that depends on input factors in a designed experiment. The response is the function of input variables.</p>
<p><strong> </strong></p>
<p><strong>Rolled Throughput Yield &#8211; </strong>Rolled Throughput Yield (YRT) is a yield metric that measures the probability that a unit of product will make it through a series of opportunities defect-free. It is calculated by multiplying the Throughput Yields at each opportunity for a defect.</p>
<p><strong> </strong></p>
<p><strong>Root Cause &#8211; </strong>A specific cause, usually a Process Input Variable (PIV) that has demonstrated a direct and significant influence on the Process Output Variable (POV).</p>
<p><strong> </strong></p>
<p><strong>Sample &#8211; </strong>A portion of the entire collection of data.</p>
<p><strong> </strong></p>
<p><strong>Sample Data &#8211; </strong>Observations made on items selected from a larger population.</p>
<p><strong> </strong></p>
<p><strong>Sample Size &#8211; </strong>The number of observations made or number of items selected from a larger population.</p>
<p><strong> </strong></p>
<p><strong>Sampling &#8211; </strong>Sampling is the process of selecting samples to estimate a characteristic of the population.</p>
<p><strong> </strong></p>
<p><strong>Scatter Plot &#8211; </strong>A type of plot used to study the relationship between two variables. It requires either continuous or discrete data. Each data point is plotted as a dot with a specific X and Y coordinate value.</p>
<p><strong> </strong></p>
<p><strong>Significance Level &#8211; </strong>The significance level (also called level of significance) of a hypothesis test is the maximum allowable probability of incorrectly rejecting the null hypothesis.</p>
<p><strong> </strong></p>
<p><strong>Simple Linear Regression &#8211; </strong>A technique in which a straight line is fitted to a set of data points to measure the effect of a single independent variable. The slope of the line is the measured impact of that variable.</p>
<p><strong> </strong></p>
<p><strong>SIPOC &#8211; </strong><strong>Suppliers, Inputs, Process, Outputs and Customer</strong>. High-level process map produced as a result of project selection and definition.</p>
<p><strong> </strong></p>
<p><strong>Skewed &#8211; </strong>Data that is asymmetrical about the mean and is not normally distributed.</p>
<p><strong> </strong></p>
<p><strong>Special Cause Variation &#8211; </strong>An instance or event that impacts the process variation only under special circumstances in which the circumstance can be clearly identified, or an anomaly that is not part of the normal everyday variation encountered in the process. Sometimes referred to as unexpected variation.</p>
<p><strong> </strong></p>
<p><strong>Stability &#8211; </strong>Stability is the amount of variability in the bias over time. It indicates the extent to which a measurement remains constant and predictable over time with respect to accuracy and precision.</p>
<p><strong> </strong></p>
<p><strong>Standard Deviation &#8211; </strong>A measure of variability which describes the spread in a set of data. It is approximately the average deviation of a single data point from the mean of that data set.</p>
<p><strong> </strong></p>
<p><strong>Standard Normal Distribution &#8211; </strong>A special case of the normal distribution in which the mean equals zero and standard deviation equals one.</p>
<p><strong> </strong></p>
<p><strong>Target I-MR Chart &#8211; </strong>A type of Control Chart for variable data that plots individual data as a difference from a target and the moving range of the present and previous individual differences.</p>
<p><strong> </strong></p>
<p><strong>Target Proportion Value &#8211; </strong>A ratio, usually pertaining to a population, that is tested using a sample proportion.</p>
<p><strong> </strong></p>
<p><strong>Target Value &#8211; </strong>A value, usually a population parameter such as mean, standard deviation or proportion, that is tested using a sample.</p>
<p><strong> </strong></p>
<p><strong>T-Distribution &#8211; </strong>A symmetrical sampling distribution used to determine the confidence interval for means and to perform hypothesis tests on means.</p>
<p><strong> </strong></p>
<p><strong>Test of Association &#8211; </strong>A type of Chi-Square Hypothesis Test that examines the hypothesis of association (non-independence) between attribute variables. This procedure is used to test if the probabilities of items or subjects being classified for one variable depend upon the classification of the other variables.</p>
<p><strong> </strong></p>
<p><strong>Test of Independence &#8211; </strong>A type of Chi-Square Hypothesis Test that examines the hypothesis of association (non-independence) between attribute variables. This procedure is used to test if the probabilities of items or subjects being classified for one variable depend upon the classification of the other variables.</p>
<p><strong> </strong></p>
<p><strong>Test of One Standard Deviation Against a Constant &#8211; </strong>A type of hypothesis test for standard deviation that tests the standard deviation of a population (also known as target standard deviation) using the standard deviation of a sample.</p>
<p><strong> </strong></p>
<p><strong>Test of Several Proportions &#8211; </strong>A type of hypothesis test for proportions that examines whether or not two or more sample proportions of certain characteristics are independent of each other.</p>
<p><strong> </strong></p>
<p><strong>Test of Three or More Variances &#8211; </strong>A type of hypothesis test for standard deviation that examines the variance between three or more samples.</p>
<p><strong> </strong></p>
<p><strong>Test of Two Variances &#8211; </strong>A type of hypothesis test for standard deviation that examines the variance between two samples.</p>
<p><strong> </strong></p>
<p><strong>Test Statistic &#8211; </strong>A quantity calculated from a sample of data. The purpose of the test statistic is to obtain a p-value in a hypothesis test.</p>
<p><strong> </strong></p>
<p><strong>Throughput Yield &#8211; </strong>Throughput Yield (YTP) is a measure of process performance at an opportunity. It is a measure of the probability of that opportunity being correct with no rework.</p>
<p><strong> </strong></p>
<p><strong>Tolerance &#8211; </strong>The difference between the Upper Specification Limit (USL) and the Lower Specification Limit (LSL).</p>
<p><strong> </strong></p>
<p><strong>Traditional Yield &#8211; </strong>The ratio of number of units that ultimately pass through the entire process to the number of units that enter into the process. Equivalent to Final Yield, which equals output divided by input.</p>
<p><strong> </strong></p>
<p><strong>Treatment Combination &#8211; </strong>A specific combination of factor levels for each factor being tested in a Design of Experiments (DOE).</p>
<p><strong> </strong></p>
<p><strong>Trend &#8211; </strong>Gradual shift of data points in one direction.</p>
<p><strong> </strong></p>
<p><strong>Two-Sided Test &#8211; </strong>Hypothesis test in which the null hypothesis is rejected because the compared values are not equal to one another, regardless of whether one is smaller (or larger) than the other. For example, the alternative hypothesis for a Two-Sided Test will merely say p1 is not equal to p2, without any consideration for which is smaller (or larger).</p>
<p><strong> </strong></p>
<p><strong>Type I Error &#8211; </strong>The result of wrongly rejecting the null hypothesis.</p>
<p><strong> </strong></p>
<p><strong>Type II Error &#8211; </strong>The result of not rejecting the null hypothesis when it should have been rejected.</p>
<p><strong> </strong></p>
<p><strong>U Chart &#8211; </strong>A graphical display of number of defects per subgroup (part or unit) sampled with a constant or variable sample size so as to detect special causes.</p>
<p><strong> </strong></p>
<p><strong>UCL &#8211; </strong>See Upper Control Limit.</p>
<p><strong> </strong></p>
<p><strong>Uniform Distribution &#8211; </strong>Uniform distribution is a continuous probability density function, which is constant over an interval (say from &#8220;a&#8221; to &#8220;b&#8221;) and zero outside that interval. For example, while throwing a fair die, the probability of getting any number from 1 to 6 is the same (1/6) but the probability of getting a number less than 1 or more than 6 is none.</p>
<p><strong> </strong></p>
<p><strong>Upper Control Limit &#8211; </strong>The Upper Control Limit (UCL) is a limit in the Control Chart, set above the centerline, typically at +3 standard deviations, in order to determine if the process is in or out of control.</p>
<p><strong> </strong></p>
<p><strong>USL &#8211; Upper Specification Limit &#8211; </strong>The Upper Specification Limit (LSL) is the upper limit of a tolerance range specified by a customer requirement. A measured value that is above this limit is considered a defect.</p>
<p><strong> </strong></p>
<p><strong>Variable Data &#8211; </strong>Sometimes used in place of the term continuous data. Variable data is quantitative data that can be divided to record values at different levels of magnitude. Examples include time, distance and weight. Note: This definition is specific to Control Charts, as the data type is referred to as continuous data in other phases of the DMAIC methodology.</p>
<p><strong> </strong></p>
<p><strong>Variance &#8211; </strong>The spread of a set of data points, measured as the average of the squared differences between the data points and their mean. Also, the square of the standard deviation.</p>
<p><strong> </strong></p>
<p><strong>Variation &#8211; </strong>Used in reference to how data points are plotted with respect to a target value. Variation is commonly thought of as the measure of spread represented in a data set.</p>
<p><strong> </strong></p>
<p><strong>Variation Component Study &#8211; </strong>A designed experiment that allows one to partition sources of variability.</p>
<p><strong> </strong></p>
<p><strong>Versions &#8211; </strong>Iterations of a process map.</p>
<p><strong> </strong></p>
<p><strong>VOC &#8211; Voice of the Customer &#8211; </strong>Voice of the Customer (VOC) encompasses comments made by customers that describe how they feel about a company&#8217;s products, services and/or transactions. These comments typically have no associated measurable standards. The company&#8217;s responsibility is to communicate with its customers to identify these measurable standards, so that requirements can be met. These measurable standards become Customer Specification Limits.</p>
<p><strong> </strong></p>
<p><strong>VOP &#8211; Voice of the Process &#8211; </strong>Voice of the Process (VOP) encompasses the entire range of the output (Y) of a process when all of the X&#8217;s in the Y = f(x) equation have varied their full range.</p>
<p><strong> </strong></p>
<p><strong>X -</strong>Process Input Variable (PIV).</p>
<p><strong> </strong></p>
<p><strong>Y &#8211; </strong>Process Output Variable (POV).</p>
<p><strong> </strong></p>
<p><strong>Y = f(x) &#8211; </strong>The mathematical relationship that identifies the inputs (X&#8217;s) that need to be controlled and the levels they need to be set at to achieve the desired output (Y) to meet customer requirements.</p>
<p><strong> </strong></p>
<p><strong>Yield &#8211; </strong>The ratio of the number of units that meet certain criteria to the number of units that enter into the process. The criteria determines if a yield is Traditional Yield or another kind of yield.</p>
<p><strong> </strong></p>
<p><strong>YF &#8211; </strong>Final Yield.</p>
<p><strong> </strong></p>
<p><strong>YNORM &#8211; </strong>See Normalized Yield.</p>
<p><strong> </strong></p>
<p><strong>YRT &#8211; </strong>See Rolled Throughput Yield.</p>
<p><strong> </strong></p>
<p><strong>YTP &#8211; </strong>See Throughput Yield.</p>
<p><strong> </strong></p>
<p><strong>Z-Distribution &#8211; </strong>A special case of the normal distribution in which the mean equals zero and the standard deviation equals one.</p>
<p><strong> </strong></p>
<p><strong>Z-Score &#8211; </strong>Standardized score for measuring process performance relative to customer requirements.</p>
<p><strong> </strong></p>
<p><strong>Z-Table &#8211; </strong>A statistical table of probabilities associated with the Z-distribution or standard normal distribution.</p>
<p><strong> </strong></p>
<p><strong>Z-Transform &#8211; </strong>An equation that transfers a set of data represented by a normal distribution into a standard normal distribution (mean equals zero and standard deviation equals one) by subtracting the original mean from the data set and dividing the result by the original standard deviation.</p>
<p><strong> </strong></p>
<p><strong>2 Sigma</strong> &#8211; 308,537DPMO</p>
<p><strong> </strong></p>
<p><strong>3 Sigma &#8211; </strong>66,807DPMO</p>
<p><strong> </strong></p>
<p><strong>4 Sigma</strong> &#8211; 6,210DPMO</p>
<p><strong> </strong></p>
<p><strong>5 Sigma</strong> &#8211; 233DPMO</p>
<p><strong> </strong></p>
<p><strong>6 Sigma &#8211; </strong>3.4DPMO</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save"><img src="http://sixsigmaz.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a> </p><h4 class='related-posts-header'>Related Posts</h4><ul class="related-posts-list"><li class="related-post"><a href="http://sixsigmaz.com/2009/08/24/normality-assumption-explained/">Normality Assumption Explained</a> <span class="related-post-date timestamp">Mon 24 Aug 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/08/22/equal-variance-assumption-explained/">Equal Variance Assumption Explained</a> <span class="related-post-date timestamp">Sat 22 Aug 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/10/15/introduction-to-dfss-design-for-six-sigma/">Introduction to DFSS Design for Six Sigma</a> <span class="related-post-date timestamp">Thu 15 Oct 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/09/01/six-sigma-brown-belt-and-other-self-declared-six-sigma-belt-colors/">Six Sigma Brown Belt and Other Self Declared Six Sigma Belt Colors</a> <span class="related-post-date timestamp">Tue 01 Sep 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/09/01/introduction-to-msa-measurement-systems-analysis/">Introduction to MSA Measurement Systems Analysis</a> <span class="related-post-date timestamp">Tue 01 Sep 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/08/28/six-sigma-core-concepts/">Six Sigma Core Concepts</a> <span class="related-post-date timestamp">Fri 28 Aug 2009</span></li></ul>]]></content:encoded>
			<wfw:commentRss>http://sixsigmaz.com/2009/08/26/six-sigma-glossary-definitions/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>101 Things Every Six Sigma Black Belt Should Know</title>
		<link>http://sixsigmaz.com/2009/08/24/101-things-every-six-sigma-black-belt-should-know/</link>
		<comments>http://sixsigmaz.com/2009/08/24/101-things-every-six-sigma-black-belt-should-know/#comments</comments>
		<pubDate>Tue, 25 Aug 2009 01:39:05 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[ANOVA]]></category>
		<category><![CDATA[SPC]]></category>
		<category><![CDATA[chi-square]]></category>
		<category><![CDATA[control chart]]></category>
		<category><![CDATA[copq]]></category>
		<category><![CDATA[Cpk]]></category>
		<category><![CDATA[erlang]]></category>
		<category><![CDATA[F-test]]></category>
		<category><![CDATA[fmea]]></category>
		<category><![CDATA[gage r&r]]></category>
		<category><![CDATA[histogram]]></category>
		<category><![CDATA[latin square]]></category>
		<category><![CDATA[mtbf]]></category>
		<category><![CDATA[ppk]]></category>
		<category><![CDATA[ppm]]></category>
		<category><![CDATA[process capability]]></category>
		<category><![CDATA[qfd]]></category>
		<category><![CDATA[reliability]]></category>
		<category><![CDATA[repeatability]]></category>
		<category><![CDATA[t-test]]></category>

		<guid isPermaLink="false">http://sixsigmaz.com/?p=146</guid>
		<description><![CDATA[In general, a Six Sigma Black Belt should be quantitatively oriented. With minimal guidance, the Six Sigma Black Belt should be able to use data to convert broad generalizations into actionable goals. The Six Sigma Black Belt should be able to make the business case for attempting to accomplish these goals. The Six Sigma Black [...]]]></description>
			<content:encoded><![CDATA[<p><!--[if gte mso 9]><xml> <w:WordDocument> <w:View>Normal</w:View> <w:Zoom>0</w:Zoom> <w:PunctuationKerning /> <w:ValidateAgainstSchemas /> <w:SaveIfXMLInvalid>false</w:SaveIfXMLInvalid> <w:IgnoreMixedContent>false</w:IgnoreMixedContent> <w:AlwaysShowPlaceholderText>false</w:AlwaysShowPlaceholderText> <w:Compatibility> <w:BreakWrappedTables /> <w:SnapToGridInCell /> <w:WrapTextWithPunct /> <w:UseAsianBreakRules /> <w:DontGrowAutofit /> </w:Compatibility> <w:BrowserLevel>MicrosoftInternetExplorer4</w:BrowserLevel> </w:WordDocument> </xml><![endif]--><!--[if gte mso 9]><xml> <w:LatentStyles DefLockedState="false" LatentStyleCount="156"> </w:LatentStyles> </xml><![endif]--><!--[if !mso]><span class="mceItemObject"   classid="clsid:38481807-CA0E-42D2-BF39-B33AF135CC4D" id=ieooui></span><br />
<mce:style><!  st1\:*{behavior:url(#ieooui) } --></p>
<p><!--[endif]--></p>
<ol type="1">
<li class="MsoNormal">In general, a Six Sigma Black      Belt should be quantitatively oriented.</li>
<li class="MsoNormal">With minimal guidance, the      Six Sigma Black Belt should be able to use data to convert broad      generalizations into actionable goals.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be able to make the business case for attempting to accomplish these      goals.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be able to develop detailed plans for achieving these goals.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be able to measure progress towards the goals in terms meaningful to      customers and leaders.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should know how to establish control systems for maintaining the gains achieved      through Six Sigma.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should understand and be able to communicate the rationale for continuous improvement,      even after initial goals have been accomplished.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be familiar with research that quantifies the benefits firms have      obtained from Six Sigma.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should know or be able to find the PPM rates associated with different sigma      levels (e.g., Six Sigma = 3.4 PPM)</li>
<li class="MsoNormal">The Six Sigma Black Belt      should know the approximate relative cost of poor quality associated with      various sigma levels (e.g., three sigma firms report 25% COPQ).</li>
<li class="MsoNormal">The Six Sigma Black Belt      should understand the roles of the various people involved in change (senior      leader, champion, mentor, change agent, technical leader, team leader,      facilitator).</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be able to design, test, and analyze customer surveys.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should know how to quantitatively analyze data from employee and customer surveys.      This includes evaluating survey reliability and validity as well as the      differences between surveys.</li>
<li class="MsoNormal">Given two or more sets of      survey data, the Six Sigma Black Belt should be able to determine if there      are statistically significant differences between them.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be able to quantify the value of customer retention.</li>
<li class="MsoNormal">Given a partly completed QFD      matrix, the Six Sigma Black Belt should be able to complete it.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be able to compute the value of money held or invested over<br />
time, including present value and future value of a fixed sum.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be able to compute present value and future value for various compounding      periods.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be able to compute the break even point for a project.</li>
<li class="MsoNormal">The Six Sigma Black Belt should      be able to compute the net present value of cash flow streams, and to use      the results to choose among competing projects.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be able to compute the internal rate of return for cash flow streams      and to use the results to choose among competing projects.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should know the COPQ rationale for Six Sigma, i.e., he should be able to      explain what to do if COPQ analysis indicates that the optimum for a given      process is less than Six Sigma.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should know the basic COPQ categories and be able to allocate a list of      costs to the correct category.</li>
<li class="MsoNormal">Given a table of COPQ data      over time, the Six Sigma Black Belt should be able to perform a      statistical analysis of the trend.</li>
<li class="MsoNormal">Given a table of COPQ data      over time, the Six Sigma Black Belt should be able to perform a      statistical analysis of the distribution of costs among the various      categories.</li>
<li class="MsoNormal">Given a list of tasks for a      project, their times to complete, and their precedence relationships, the      Six Sigma Black Belt should be able to compute the time to completion for      the project, the earliest completion times, the latest completion<br />
times and the slack times. He should also be able to identify which<br />
tasks are on the critical path.</li>
<li class="MsoNormal">Give cost and time data for      project tasks, the Six Sigma Black Belt should be able to compute the cost      of normal and crash schedules and the minimum total cost<br />
schedule.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be familiar with the basic principles of benchmarking.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be familiar with the limitations of benchmarking.</li>
<li class="MsoNormal">Given an organization chart      and a listing of team members, process owners, and sponsors, the Six Sigma      Black Belt should be able to identify projects with a low probability of      success.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be able to identify measurement scales of various metrics (nominal,      ordinal, etc).</li>
<li class="MsoNormal">Given a metric on a      particular scale, the Six Sigma Black Belt should be able to determine if      a particular statistical method should be used for analysis.</li>
<li class="MsoNormal">Given a properly collected      set of data, the Six Sigma Black Belt should be able to perform a complete      measurement system analysis, including the calculation of<br />
bias, repeatability, reproducibility, stability, discrimination      (resolution)<br />
and linearity.</li>
<li class="MsoNormal">Given the measurement system      metrics, the Six Sigma Black Belt should know whether or not a given measurement      system should be used on a given part or process.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should know the difference between computing sigma from a data set whose      production sequence is known and from a data set whose production sequence      is not known.</li>
<li class="MsoNormal">Given the results of an AIAG      Gage R&amp;R study, the Six Sigma Black Belt should be able to answer a      variety of questions about the measurement system.</li>
<li class="MsoNormal">Given a narrative description      of “as-is” and “should-be” processes, the Six<br />
Sigma Black Belt should be able to prepare process maps.</li>
<li class="MsoNormal">Given a table of raw data,      the Six Sigma Black Belt should be able to prepare a frequency tally sheet      of the data, and to use the tally sheet data to construct a<br />
histogram.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be able to compute the mean and standard deviation from a grouped frequency      distribution.</li>
<li class="MsoNormal">Given a list of problems, the      Six Sigma Black Belt should be able to construct a Pareto Diagram of the      problem frequencies.</li>
<li class="MsoNormal">Given a list which describes      problems by department, the Six Sigma Black Belt should be able to construct      a Crosstabulation and use the information to perform a Chi-square analysis.</li>
<li class="MsoNormal">Given a table of x and y data      pairs, the Six Sigma Black Belt should be able to determine if the      relationship is linear or non-linear.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should know how to use non-linearity’s to make products or processes more      robust.</li>
<li class="MsoNormal">The Six Sigma Black Belt should      be able to construct and interpret a run chart when given a table of data      in time-ordered sequence. This includes calculating<br />
run length, number of runs and quantitative trend evaluation.</li>
<li class="MsoNormal">When told the data are from      an exponential or Erlang distribution the Six Sigma Black Belt should know      that the run chart is preferred over the standard X control chart.</li>
<li class="MsoNormal">Given a set of raw data the      Six Sigma Black Belt should be able to identify and compute two statistical      measures each for central tendency, dispersion, and shape.</li>
<li class="MsoNormal">Given a set of raw data, the      Six Sigma Black Belt should be able to construct a histogram.</li>
<li class="MsoNormal">Given a stem &amp; leaf plot,      the Six Sigma Black Belt should be able to reproduce a sample of numbers      to the accuracy allowed by the plot.</li>
<li class="MsoNormal">Given a box plot with numbers      on the key box points, the Six Sigma Black Belt should be able to identify      the 25th and 75th percentile and the median.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should know when to apply enumerative statistical methods, and when not      to.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should know when to apply analytic statistical methods, and when not to.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should demonstrate a grasp of basic probability concepts, such as the      probability of mutually exclusive events, of dependent and independent events,      of events that can occur simultaneously, etc.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should know factorials, permutations and combinations, and how to use      these in commonly used probability distributions.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be able to compute expected values for continuous and discrete random      variables.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be able to compute univariate statistics for samples.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be able to compute confidence intervals for various statistics.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be able to read values from a cumulative frequency ogive.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be familiar with the commonly used probability distributions, including:      hypergeometric, binomial, Poisson, normal, exponential,<br />
chi-square, Student’s t, and F.</li>
<li class="MsoNormal">Given a set of data the Six      Sigma Black Belt should be able to correctly identify which distribution should      be used to perform a given analysis, and to use the distribution to      perform the analysis.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should know that different techniques are required for analysis depending on      whether a given measure (e.g., the mean) is assumed known or estimated from      a sample. The Six Sigma Black Belt should choose and properly use the      correct technique when provided with data and sufficient information about      the data.</li>
<li class="MsoNormal">Given a set of subgrouped      data, the Six Sigma Black Belt should be able to select and prepare the      correct control charts and to determine if a given process is<br />
in a state of statistical control.</li>
<li class="MsoNormal">The above should be      demonstrated for data representing all of the most common control charts.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should understand the assumptions that underlie ANOVA, and be able to      select and apply a transformation to the data.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be able to identify which cause on a list of possible causes will      most likely explain a non-random pattern in the regression residuals.</li>
<li class="MsoNormal">If shown control chart      patterns, the Six Sigma Black Belt should be able to match the control      chart with the correct situation (e.g., an outlier pattern vs. a gradual<br />
trend matched to a tool breaking vs. a machine gradually warming up).</li>
<li class="MsoNormal">The Six Sigma Black Belt      should understand the mechanics of PRE-Control.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should be able to correctly apply EWMA charts to a process with serial correlation      in the data.</li>
<li class="MsoNormal">Given a stable set of      subgrouped data, the Six Sigma Black Belt should be able to perform a      complete Process Capability Analysis. This includes computing and      interpreting capability indices, estimating the % failures, control limit      calculations, etc.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should demonstrate an awareness of the assumptions that underlie the use      of capability indices.</li>
<li class="MsoNormal">Given the results of a      replicated 22 full-factorial experiment, the Six Sigma Black Belt should      be able to complete the entire ANOVA table.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should understand the basic principles of planning a statistically designed      experiment. This can be demonstrated by critiquing various<br />
experimental plans with or without shortcomings.</li>
<li class="MsoNormal">Given a “clean” experimental      plan, the Six Sigma Black Belt should be able to find the correct number      of replicates to obtain a desired power.</li>
<li class="MsoNormal">The Six Sigma Black Belt      should know the difference between the various types of experimental models      (fixed-effects, random-effects, mixed).</li>
<li class="MsoNormal">The Six Sigma Black Belt      should understand the concepts of randomization and blocking.</li>
<li class="MsoNormal">Given a set of data, the Six      Sigma Black Belt should be able to perform a Latin Square analysis and interpret      the results.</li>
<li class="MsoNormal">Ditto for one way ANOVA, two      way ANOVA (with and without replicates), full and fractional factorials, and      response surface designs.</li>
<li class="MsoNormal">Given an appropriate      experimental result, the Six Sigma Black Belt should be able to compute      the direction of steepest ascent.</li>
<li class="MsoNormal">Given a set of variables each      at two levels, the Six Sigma Black Belt can determine the correct experimental      layout for a screening experiment using a saturated design.</li>
<li class="MsoNormal">Given data for such an      experiment, the Six Sigma Black Belt can identify which main effects are      significant and state the effect of these factors.</li>
<li class="MsoNormal">Given two or more sets of      responses to categorical items (e.g., customer survey responses      categorized as poor, fair, good, excellent), the Six Sigma Black Belt will      be able to perform a Chi-Square test to determine if the samples are<br />
significantly different.</li>
<li class="MsoNormal">The Six Sigma Black Belt will      understand the idea of confounding and be able to identify which two      factor interactions are confounded with the significant main effects.</li>
<li class="MsoNormal">The Six Sigma Black Beltwill      be able to state the direction of steepest ascent from experimental data.</li>
<li class="MsoNormal">The Six Sigma Black Belt will      understand fold over designs and be able to identify the fold over design      that will clear a given alias.</li>
<li class="MsoNormal">The Six Sigma Black Belt will      know how to augment a factorial design to create a composite or central      composite design.</li>
<li class="MsoNormal">The Six Sigma Black Belt will      be able to evaluate the diagnostics for an experiment.</li>
<li class="MsoNormal">The Six Sigma Black Belt will      be able to identify the need for a transformation in y and to apply the      correct transformation.</li>
<li class="MsoNormal">Given a response surface      equation in quadratic form, the Six Sigma Black Belt will be able to      compute the stationary point.</li>
<li class="MsoNormal">Given data (not graphics),      the Six Sigma Black Belt will be able to determine if the stationary point      is a maximum, minimum or saddle point.</li>
<li class="MsoNormal">The Six Sigma Black Belt will      be able to use a quadratic loss function to compute the cost of a given      process.</li>
<li class="MsoNormal">The Six Sigma Black Belt will      be able to conduct simple and multiple linear regression.</li>
<li class="MsoNormal">The Six Sigma Black Belt will      be able to identify patterns in residuals from an improper regression model      and to apply the correct remedy.</li>
<li class="MsoNormal">The Six Sigma Black Belt will      understand the difference between regression and correlation studies.</li>
<li class="MsoNormal">The Six Sigma Black Belt will      be able to perform chi-square analysis of contingency tables.</li>
<li class="MsoNormal">The Six Sigma Black Belt will      be able to compute basic reliability statistics (mtbf, availability, etc.).</li>
<li class="MsoNormal">Given the failure rates for      given subsystems, the Six Sigma Black Belt will be able to use reliability      apportionment to set mtbf goals.</li>
<li class="MsoNormal">The Six Sigma Black Belt will      be able to compute the reliability of series, parallel, and      series-parallel system configurations.</li>
<li class="MsoNormal">The Six Sigma Black Belt will      demonstrate the ability to create and read an FMEA analysis.</li>
<li class="MsoNormal">The Six Sigma Black Belt will      demonstrate the ability to create and read a fault tree.</li>
<li class="MsoNormal">Given distributions of      strength and stress, the Six Sigma Black Belt will be able to compute the      probability of failure.</li>
</ol>
<p class="MsoNormal" style="margin-left: 0.5in; text-indent: -0.25in;"><!--[if !supportLists]--><span>100.<span style="font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal; -x-system-font: none;"> </span></span><!--[endif]-->The Six Sigma Black Belt will be able to apply statistical tolerancing to set tolerances for<br />
simple assemblies. He will know how to compare statistical tolerances<br />
to so-called “worst case” tolerancing.</p>
<p class="MsoNormal" style="margin-left: 0.5in; text-indent: -0.25in;">101. The Six Sigma Black Belt will      be aware of the limits of the Six Sigma approach.</p>
<p class="MsoNormal"><a href="http://sixsigmaz.com/wp-content/uploads/2009/08/six-sigma-black-belt.png" rel="lightbox[146]"><img class="alignright size-thumbnail wp-image-151" title="six sigma black belt" src="http://sixsigmaz.com/wp-content/uploads/2009/08/six-sigma-black-belt-150x150.png" alt="six sigma black belt" width="150" height="150" /></a></p>
<p style="text-align: center;" align="center">Copyright © 2003<br />
by Thomas Pyzdek, all rights reserved</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save"><img src="http://sixsigmaz.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a> </p><h4 class='related-posts-header'>Related Posts</h4><ul class="related-posts-list"><li class="related-post"><a href="http://sixsigmaz.com/2009/08/26/six-sigma-and-baseball/">Six Sigma and Baseball</a> <span class="related-post-date timestamp">Wed 26 Aug 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/10/15/top-10-six-sigma-black-belt-excuses/">Top 10 Six Sigma Black Belt Excuses</a> <span class="related-post-date timestamp">Thu 15 Oct 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/10/14/measurement-system-analysis-for-beginners/">Measurement System Analysis for Beginners</a> <span class="related-post-date timestamp">Wed 14 Oct 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/08/26/six-sigma-glossary-definitions/">Six Sigma Glossary & Definitions</a> <span class="related-post-date timestamp">Wed 26 Aug 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/08/24/normality-assumption-explained/">Normality Assumption Explained</a> <span class="related-post-date timestamp">Mon 24 Aug 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/08/22/equal-variance-assumption-explained/">Equal Variance Assumption Explained</a> <span class="related-post-date timestamp">Sat 22 Aug 2009</span></li></ul>]]></content:encoded>
			<wfw:commentRss>http://sixsigmaz.com/2009/08/24/101-things-every-six-sigma-black-belt-should-know/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Normality Assumption Explained</title>
		<link>http://sixsigmaz.com/2009/08/24/normality-assumption-explained/</link>
		<comments>http://sixsigmaz.com/2009/08/24/normality-assumption-explained/#comments</comments>
		<pubDate>Mon, 24 Aug 2009 17:10:32 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[ANOVA]]></category>
		<category><![CDATA[DMAIC]]></category>
		<category><![CDATA[SPC]]></category>
		<category><![CDATA[Anderson-Darling]]></category>
		<category><![CDATA[basic statistics]]></category>
		<category><![CDATA[central limit theorem]]></category>
		<category><![CDATA[Critical Ys]]></category>
		<category><![CDATA[CTQ]]></category>
		<category><![CDATA[descriptive statistics]]></category>
		<category><![CDATA[histrogram]]></category>
		<category><![CDATA[Normality Assumption]]></category>
		<category><![CDATA[p-value]]></category>
		<category><![CDATA[pareto]]></category>
		<category><![CDATA[regression]]></category>
		<category><![CDATA[sample population]]></category>
		<category><![CDATA[t-test]]></category>
		<category><![CDATA[z-test]]></category>

		<guid isPermaLink="false">http://sixsigmaz.com/?p=138</guid>
		<description><![CDATA[The normality assumption is an extremely important topic in statistics, since the vast majority of statistical tools were built theoretically upon this assumption. For example, the 1-sample and 2-sample t-tests and Z-tests, along with the corresponding confidence intervals, assume that the data were sampled from populationsards. Intended Use The normality assumption is an extremely important [...]]]></description>
			<content:encoded><![CDATA[<p>The normality assumption is an extremely important topic in statistics, since the vast majority of statistical tools were built theoretically upon this assumption. For example, the 1-sample and 2-sample t-tests and Z-tests, along with the corresponding confidence intervals, assume that the data were sampled from populationsards.</p>
<p class="MsoSubtitle">
<p class="MsoSubtitle"><span style="font-size: 14pt;">Intended Use</span></p>
<p class="MsoBodyText">The normality assumption is an extremely important topic in statistics, since the vast majority of statistical tools were built theoretically upon this assumption. For example, the 1-sample and 2-sample t-tests and Z-tests, along with the corresponding confidence intervals, assume that the data were sampled from populations having normal distributions. Most linear modeling procedures, such as Regression and ANOVA, also assume that the residuals (errors) from the model are normally distributed. In addition, the most widely used control charts and process capability statistics are based upon theoretical assumptions about the normality of the process data.</p>
<p class="MsoBodyText">
<p class="MsoBodyText">Since this assumption is often a prominent part of using many statistical tools, it is often suggested that tests be run to check on the validity of this assumption. When doing so, one should always keep in mind the following points:</p>
<p class="MsoBodyText">
<p class="MsoBodyText" style="margin-left: 0.5in; text-indent: -0.25in;"><!--[if !supportLists]--><span><span>1.<span style="font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;"> </span></span></span><!--[endif]-->Relative importance of the normality assumption.</p>
<p class="MsoBodyText" style="margin-left: 0.25in;">Most statistical tools that assume normality have additional assumptions. In the majority of cases the other assumptions are more important to the validity of the tool than the normality assumption. For example, most statistical tools are very robust to departures from normality, but it is critical that the data are collected independently.</p>
<p class="MsoBodyText" style="margin-left: 0.25in;">
<p class="MsoBodyText" style="margin-left: 0.5in; text-indent: -0.25in;"><!--[if !supportLists]--><span><span>2.<span style="font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;"> </span></span></span><!--[endif]-->The type of non-normality.</p>
<p class="MsoBodyText" style="margin-left: 0.25in;">A normal distribution is symmetric, with a certain percentage of the data within 1 standard deviation, within 2 standard deviations, within 3 standard deviations, and so on. Departures from normality can mean that the data are not symmetric. However, this is not always the case. The data may be symmetric, but the percentages of data within certain bounds do not match those for a normal distribution. Both of these will fail a test for normality, but the second is much less serious than the first.</p>
<p class="MsoBodyText" style="margin-left: 0.25in;">
<p class="MsoBodyText" style="margin-left: 0.5in; text-indent: -0.25in;"><!--[if !supportLists]--><span><span>3.<span style="font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;"> </span></span></span><!--[endif]-->Data transformations.</p>
<p class="MsoBodyText" style="margin-left: 0.25in;">In many cases where the data do not fit a normal distribution, transformations exist that will make the data “more normal”. The log transformation (or the Box-Cox power transformation) is very effective for skewed data. The arcsin transformation can be used for binomial proportions.</p>
<p class="MsoNormal" style="padding-left: 30px;"><span style="font-size: 10pt; font-family: Arial;"> </span><span style="font-size: 10pt; font-family: Arial;">4.  The      effects of non-normality </span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;">In all cases, non-normality affects the probability of making a wrong decision, whether it be rejecting the null hypothesis when it is true (Type I error) or accepting the null hypothesis when it is false (Type II error). The p-value (probability of making a Type I error) associated with most statistical tools is underestimated when the assumption of normality is violated. In other words, the true p-value is somewhat larger than the reported p-value.</span><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoNormal" style="padding-left: 30px;"><span style="font-size: 10pt; font-family: Arial;">5. The      impact of sample size. </span></p>
<p class="MsoNormal" style="margin-left: 0.25in;"><span style="font-size: 10pt; font-family: Arial;">For large samples (n &gt;= 25), the effects of non-normality on the probabilities of making errors are minimized, due to the Central Limit Theorem. Sample size also affects the procedures you use to test for normality, which can be very erratic for small samples.</span></p>
<p class="MsoNormal" style="padding-left: 30px;"><span style="font-size: 10pt; font-family: Arial;"> </span><span style="font-size: 10pt; font-family: Arial;">6. Normality      is assumed for the <span style="text-decoration: underline;">population</span>, not the <span style="text-decoration: underline;">sample</span>. </span></p>
<p class="MsoNormal" style="margin-left: 0.25in;"><span style="font-size: 10pt; font-family: Arial;">When we sample data from a population, we often collect small amounts of measurements. Before we test this small set of measurements, we should consider whether we have ever collected data from this population before and, if so, include all the data in the assessment of normality.</span></p>
<p class="MsoNormal" style="margin-left: 0.25in;"><span style="font-size: 10pt; font-family: Arial;">7. Parametric      tests vs non-parametric tests.</span></p>
<p class="MsoNormal" style="margin-left: 0.25in;"><span style="font-size: 10pt; font-family: Arial;">There are many non-parametric alternatives to standard statistical tests. However, it should be noted these also assume that the underlying distributions are symmetric. Most standard statistical tools that assume normality will also work fine if the data are symmetric. Also, consider that the power of the non-parametric tests only approaches the power of the standard parametric tests when the sample size is very large. So, given the choice between the two, if the data are fairly symmetric, the standard parametric tests are better choices than the non-parametric alternatives.</span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;"><span> </span></span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;">What is the bottom line? If you use a statistical tool that assumes normality, you can and probably should test this assumption. Use a normal probability plot, a statistical test, AND a histogram. Never use a statistical test for normality without also using either a histogram (best choice) or a probability plot. If the test for normality fails, check for symmetry with the histogram or probability plot. If you have a large sample, the data would have to be extremely skewed before there is cause for concern. For skewed data, try a log (or power) transformation. With small samples, you are taking chances with any tool, including the probability plot, the test for normality and the histogram. Also, non-parametric alternatives have little discriminatory power with small samples. If you use a statistical tool that assumes normality, and the test fails, remember that the p-value you see will be smaller than it actually should be. This is only cause for concern when the p-value is marginally significant. You might want to run the non-parametric alternative (if there is one) and see if the results agree. And, consider the practical significance as well as the statistical significance.</span></p>
<h1><span style="font-size: 14pt;"> </span></h1>
<h1><span style="font-size: 14pt;">How it Fits With the Breakthrough Strategy (DMAIC)</span></h1>
<p class="MsoBodyText"><span style="font-family: Arial;"> </span>The Normality Assumption is assumed for various tools that are utilized in the following phases of the Breakthrough Strategy,</p>
<p class="MsoBodyText" style="margin-left: 0.5in;"><strong><a href="http://templatestaff.com/quality-assurance-concepts-articles-menu-60/3-normality-assumption#_Measure_Phase">Measure Phase</a></strong></p>
<p class="MsoBodyText" style="margin-left: 0.5in;"><strong><a href="http://templatestaff.com/quality-assurance-concepts-articles-menu-60/3-normality-assumption#_Analyze_Phase">Analyze Phase</a></strong></p>
<p class="MsoBodyText" style="margin-left: 0.5in;"><strong><a href="http://templatestaff.com/quality-assurance-concepts-articles-menu-60/3-normality-assumption#_Analyze_Phase">Improve Phase</a></strong></p>
<p class="MsoBodyText" style="margin-left: 0.5in;"><strong><a href="http://templatestaff.com/quality-assurance-concepts-articles-menu-60/3-normality-assumption#_Control_Phase">Control Phase</a></strong></p>
<p class="MsoNormal"><span style="font-family: Arial;"> </span></p>
<h1>Measure Phase</h1>
<p class="MsoNormal"><span style="font-family: Arial;"> </span><span style="font-size: 10pt; font-family: Arial;">In the Measure Phase of a Black Belt project, the Black Belt must perform an initial capability study to obtain a baseline for the process. This capability study should incorporate control charts to verify that the process was stable, as well as capability indices, which gage its performance. Both the control charts and the capability statistics are theoretically founded upon an assumption of normality. </span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;">As for control charts, the effects of non-normality show up in the probabilities associated with falsely failing the tests for special causes. It has been shown (Wheeler) that these effects are minimal for Xbar charts, even for very small (n &lt; 5) subgroup sizes, provided the data are not extremely skewed. Even in cases of extremely skewed data, the effects are minimal if the subgroup size is 5 or more. </span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;">Process capability indices themselves do not assume normality – they are simply ratios of distances. However, if one makes inferences about the defect rate based upon these indices, these inferences can be in error if the data are extremely skewed. One fortunate aspect of process capability studies is that it is standard practice to base them upon at least 100 observations, therefore lessening the impact of the normality assumption.</span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;"><span> </span></span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;">Minitab’s capability study shows a histogram of the sample data, along with capability indices, which are used to estimate process performance. The histogram has a normal curve superimposed over it to make it easier to assess the normality of the data. Minitab also has a nice facility for finding the optimum power transformation for use with data in subgroups. For cases with heavily skewed data, Minitab will also calculate process capability assuming a Weibull distribution instead of a normal distribution. </span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;"><br />
</span></p>
<h1><a title="_Analyze_Phase" name="_Analyze_Phase"></a>Analyze Phase</h1>
<p class="MsoNormal"><span style="font-family: Arial;"> </span><span style="font-size: 10pt; font-family: Arial;">In the Analyze Phase of a Black Belt project, the Black Belt must isolate variables which exert leverage on the CTQ. These leverage variables are uncovered through the use of various statistical tools designed to detect differences in means, differences in variances, patterns in means, or patterns in variances, in the case where the CTQ is continuous.</span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;">Most of these tools assume normality. For many, there are non-parametric alternatives. As mentioned earlier, one should use good sense and judgement when deciding which test is more appropriate. It is a good idea to test for normality. If the test fails, check the symmetry of the data. With a large (n &gt;= 25) sample, the parametric tools are better choices than the non-parametric alternatives, unless the data are excessively skewed.</span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoBodyText">For example, the 1-sample and 2-sample t-test (and Z-test) also assume that the samples are independent, that the measurement scale is continuous, and that the variances (for the 2-sample case) are equal. Of this set of assumptions, the least important is the assumption of normality, since these tests are very robust to the assumption of normality, provided the samples are large and that the underlying distribution(s) is (are) fairly symmetric. It is interesting to note that the non-parametric alternatives to the t and Z tests also assume that the underlying distributions are symmetric.</p>
<p class="MsoBodyText">
<p class="MsoBodyText">Similarly, linear modeling procedures such as ANOVA and Regression also assume that the residuals (errors) are independent, identically distributed, with a continuous measurement scale. Of these assumptions, the least important is again the assumption of normality. Both ANOVA and Regression are very robust to this assumption, provided the underlying distribution of the residuals is fairly symmetric.</p>
<p class="MsoBodyText">
<p class="MsoBodyText">When using these tools, don’t forget to try data transformations, especially when the data are skewed. Keep in mind that the p-values you see may be underestimated if the data are not from a normal distribution. The effect on the p-values is minimized for large samples. If a standard parametric test is used and the reported p-value is marginally significant, then the actual p-value may be marginally insignificant. When using ANY statistical tool, one should ALWAYS consider the practical significance of the result as well as the statistical significance of the result before passing final judgement.</p>
<p class="MsoBodyText">
<h1><a title="_Improve_Phase" name="_Improve_Phase"></a>Improve Phase</h1>
<p class="MsoBodyText">In the Improve Phase, the Black Belt will often use designed experiments to make dramatic improvements in the performance of the CTQ. A designed experiment is a procedure for simultaneously altering all of the leverage variables discovered in the Analyze Phase and observing what effects these changes have on the CTQ. The Black Belt must determine exactly which leverage variables are critical to improving the performance of the CTQ, and establish settings for those critical variables.</p>
<p class="MsoBodyText">
<p class="MsoBodyText">In order to determine whether an effect from a leverage variable, or an interaction between 2 or more leverage variables, is statistically significant, the Black Belt will often utilize an ANOVA table, or a Pareto chart of effects, or a normal plot of effects. The ANOVA table displays p-values for assessing the statistical significance of model components. The Pareto chart of effects has a cutoff line to show which effects are statistically significant. The normal plot of effects is another tool for judging which effects are statistically significant.</p>
<p class="MsoBodyText">
<p class="MsoBodyText">All of these methods assume that the residuals after fitting the model are from a normal distribution. Keep in mind that these methods are pretty robust to non-normal data, but it would still be wise to check a histogram of the residuals to be sure there are no extreme departures from normality, or more importantly, are not excessively skewed. If the data are heavily skewed, use a transformation. Also, bear in mind that the p-values in the ANOVA table may be slightly underestimated, and that the cutoff line in the Pareto chart is slightly higher than it should be. In other words, some effects that are observed to be marginally statistically significant could actually be marginally insignificant. In any case where an effect is marginally significant, or marginally insignificant, from a statistical point of view, one should always ask whether it is of practical significance before passing final judgement.</p>
<p class="MsoNormal"><span style="font-family: Arial;"> </span></p>
<h1><a title="_Control_Phase" name="_Control_Phase"></a>Control Phase</h1>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;">In the Control Phase, the Black Belt devises a control plan for each of the critical leverage variables from the Improve Phase. These are the variables which truly drive the relationship Y = f(X1,X2,…,Xn). In order to maintain the gains in performance for the CTQ, or Y, the X variables must be controlled. Part of the control plan is a sampling scheme for each X, coupled with a control chart to detect whether each X is staying on target.</span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;">Control charts for continuous data assume the data are from a normal distribution, although control charts have been shown to be very robust to the assumption of normality, in particular the Xbar chart. A simulation study found in Wheeler shows that even for subgroups of size 3, the Xbar chart is robust to non-normality except for excessively skewed data. For subgroups of size 5, the Xbar chart is robust even if the underlying data are excessively skewed. </span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;">If the I chart (subgroup size = 1) is used, the effects of non-normality will be seen in an elevated rate of false alarms from the tests for special causes. These tests are designed to have low probabilities of false alarms, based upon a normal distribution. Depending upon the type of departure from normality, certain tests will exhibit higher false alarm rates.</span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;"> </span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;">As with the tools mentioned earlier, you can transform the data, which is particularly effective for skewed data. Minitab has a facility for selecting the optimum power transformation to use with data in subgroups. Use this feature for skewed data.</span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;"><span> </span></span></p>
<h1><a title="_Anatomy_of_a" name="_Anatomy_of_a"></a><span style="font-size: 14pt;"> </span></h1>
<h1><span style="font-size: 14pt;">Considerations for the Normality Assumption</span></h1>
<p class="MsoBodyText">The use of statistical tools does not follow some exact cookbook. By their very nature, there is always an element of error associated with these tools. The same is true about the assumption of normality. No one can make the claim that they have ever collected data from an exact normal distribution, because it does not exist. However, many naturally occurring phenomena follow a very close approximate normal distribution. So, when assessing the validity of the normality assumption, keep in mind the points outlined earlier:</p>
<p class="MsoBodyText">
<p><!--[if !supportLists]--><!--[endif]--><span><span>1.<span style="font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;"> </span></span></span>Relative importance of the normality assumption.</p>
<p><!--[if !supportLists]--><!--[endif]--><span><span>2.<span style="font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;"> </span></span></span>The type of non-normality.</p>
<p><!--[if !supportLists]--><!--[endif]--><span><span>3.<span style="font-family: &quot;Times New Roman&quot;; font-style: normal; font-variant: normal; font-weight: normal; font-size: 7pt; line-height: normal; font-size-adjust: none; font-stretch: normal;"> </span></span></span>Data transformations.</p>
<p><span style="font-size: 10pt; font-family: Arial;">4. The      effects of non-normality.</span></p>
<p><span style="font-size: 10pt; font-family: Arial;">5.  The      impact of sample size. </span><br />
<span style="font-size: 10pt; font-family: Arial;">6.  Normality      is assumed for the <span style="text-decoration: underline;">population</span>, not the <span style="text-decoration: underline;">sample</span>. </span><br />
<span style="font-size: 10pt; font-family: Arial;">7.  Parametric      tests vs non-parametric tests.</span></p>
<p><span style="font-size: 10pt; font-family: Arial;"><br />
</span></p>
<p class="MsoNormal"><span style="font-size: 14pt;">Cookbook for Checking the Normality Assumption</span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;">Minitab has several tools for assessing normality. The statistical tests for normality should never be used by themselves &#8211; they should always be accompanied by either a probability plot or a histogram. </span></p>
<p class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;">To check normality for a sample of data</span></p>
<ol style="margin-top: 0in;">
<li class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;">Go to      the Stat menu, select Basic Statistics.</span></li>
<li class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;">Select      Display Descriptive Statistics </span></li>
<li class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;">In the Variables box, enter the columns you want to check for normality. Note, each column you enter will generate its own display. </span></li>
<li class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;">Click      the Graphs button.</span></li>
<li class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;">Check      the box for Graphical Summary, then click Okay. </span></li>
<li class="MsoNormal"><span style="font-size: 10pt; font-family: Arial;">You will see a display as shown below. Check the p-value associated with the Anderson-Darling test for normality. Also check the histogram for symmetry in the data.</span></li>
</ol>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save"><img src="http://sixsigmaz.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a> </p><h4 class='related-posts-header'>Related Posts</h4><ul class="related-posts-list"><li class="related-post"><a href="http://sixsigmaz.com/2009/08/22/equal-variance-assumption-explained/">Equal Variance Assumption Explained</a> <span class="related-post-date timestamp">Sat 22 Aug 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/11/11/six-sigma-and-minitab-a-complete-toolbox-guide-for-all-six-sigma-practitioners/">Six Sigma and Minitab A complete Toolbox Guide for All Six Sigma Practitioners</a> <span class="related-post-date timestamp">Wed 11 Nov 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/08/26/six-sigma-and-baseball/">Six Sigma and Baseball</a> <span class="related-post-date timestamp">Wed 26 Aug 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/08/26/six-sigma-glossary-definitions/">Six Sigma Glossary & Definitions</a> <span class="related-post-date timestamp">Wed 26 Aug 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/10/15/introduction-to-dfss-design-for-six-sigma/">Introduction to DFSS Design for Six Sigma</a> <span class="related-post-date timestamp">Thu 15 Oct 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/09/21/introduction-to-design-of-experiments/">Introduction to Design of Experiments</a> <span class="related-post-date timestamp">Mon 21 Sep 2009</span></li></ul>]]></content:encoded>
			<wfw:commentRss>http://sixsigmaz.com/2009/08/24/normality-assumption-explained/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Equal Variance Assumption Explained</title>
		<link>http://sixsigmaz.com/2009/08/22/equal-variance-assumption-explained/</link>
		<comments>http://sixsigmaz.com/2009/08/22/equal-variance-assumption-explained/#comments</comments>
		<pubDate>Sun, 23 Aug 2009 01:55:24 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[ANOVA]]></category>
		<category><![CDATA[CTQ]]></category>
		<category><![CDATA[DMAIC]]></category>
		<category><![CDATA[pareto]]></category>
		<category><![CDATA[t-test]]></category>

		<guid isPermaLink="false">http://sixsigmaz.com/?p=21</guid>
		<description><![CDATA[The equal variance assumption is important in statistics because it applies to two of the most widely used tools, the two-sample t-test, and Analysis of Variance (ANOVA). Since this assumption plays a prominent role when utilizing these popular tools, it is often suggested that tests be run to check on the validity of this assumption. [...]]]></description>
			<content:encoded><![CDATA[<p>The equal variance assumption is important in statistics because it applies to two of the most widely used tools, the two-sample t-test, and Analysis of Variance (ANOVA).<br />
Since this assumption plays a prominent role when utilizing these popular tools, it is often suggested that tests be run to check on the validity of this assumption. When doing so, one should always keep in mind the following points:</p>
<p>1. Robustness when samples sizes are equal.</p>
<p>Both the two-sample t-test and ANOVA are very robust to the equal variance assumption when the sample sizes are equal, or nearly equal.</p>
<p>2.  Alternative procedures.</p>
<p>The two-sample t-test can be used either with or without the assumption of equal variances. In fact, the default method in Minitab does not assume equal variances.</p>
<p>3.  Data transformations.</p>
<p>If the variances of the samples are correlated with the size of the data (as Y increases, the variance of Y increases), it may be possible to use a log transformation (or the Box-Cox power transformation) to correct the problem.</p>
<p>4.  The effects of unequal variances.</p>
<p>In all cases, unequal variances affect the overall estimate of the error variance. This in turn affects the corresponding t or F statistics, which in turn affects the reported p-values. The p-value (probability of making a Type I error) is underestimated when the assumption of equal variances is violated. In other words, the true p-value is somewhat larger than the reported p-value.</p>
<p>5.  Parametric tests vs non-parametric tests.</p>
<p>There are non-parametric alternatives to both the two-sample t-test and ANOVA. However, it should be noted these also assume that the underlying distributions are symmetric, with the same shape. Stating that the distributions have the same shape is tantamount to stating that they have the same variance. Since both the parametric and the non-parametric tests assume basically the same thing, the parametric tests are preferred, since they are uniformly more powerful.</p>
<p>6.  Differences in variance are also desirable.</p>
<p>One of the major goals of every project should be to reduce the variation in the CTQ, or Y. Don’t forget that the two-sample t-test and ANOVA are both methods for detecting changes in the mean of Y. If changes in the variation are observed, these are important, perhaps more important than the changes in the mean one is testing for. These differences should be studied to determine if they are consistent.</p>
<p>What is the bottom line? If you use a statistical tool that assumes equal variance, you can and probably should test this assumption. Remember that if the sample sizes are equal, or nearly equal, this assumption can be relaxed a great deal. Also, as a rule of thumb, even when the sample sizes are not nearly equal, there is usually no problem provided the largest sample standard deviation is not more than twice the size of the smallest sample standard deviation. When there is a problem, remember that what may be a “problem” as far as testing for differences in the mean of Y can also be a “solution” for determining ways to reduce the variation in Y. Be sure to check whether the variances increase as the size of the data increases. This is not at all uncommon, and can be remedied easily with a log or Box-Cox power transformation of the data. If all else fails, remember that the p-value you see will be smaller than it actually should be. This is only cause for concern when the p-value is marginally significant. You might want to run the non-parametric alternative (if there is one) and see if the results agree. And, consider the practical significance as well as the statistical significance.</p>
<h1>How it Fits with the Breakthrough Strategy (DMAIC)</h1>
<p>The Normality Assumption is assumed for tools that are utilized in the following phases of the Breakthrough Strategy,</p>
<h1>Analyze Phase</h1>
<p>In the Analyze Phase of a Black Belt project, the Black Belt must isolate variables which exert leverage on the CTQ. These leverage variables are uncovered through the use of various statistical tools designed to detect differences in means, differences in variances, patterns in means, or patterns in variances, in the case where the CTQ is continuous.</p>
<p>Two of the prominent tools for detecting differences in means are the two-sample t-test and ANOVA. The assumptions for both of these are the same, since the two-sample t-test is just a special case of one-way ANOVA. Both of these tools also have non-parametric alternatives, at least in some cases. The t-test also has an alternative where equal variances are not assumed. There are non-parametric alternatives for one-way and two-way ANOVA. As mentioned earlier, one should use good sense and judgement when deciding which test is more appropriate.</p>
<p>It is a good idea to test for equal variances. If the test fails, check the ratio of the largest sample standard deviation to the smallest sample standard deviation. If the sample sizes are equal, or nearly equal, there should not be a problem unless this ratio is larger than 4. Even when the sample sizes are not nearly equal, there should not be a problem unless this ratio is greater than 2. And, more importantly, remember that you are using these tools to detect changes in the mean of Y, and the “problem” you are having is that you have detected changes in the variation of Y at the same time. Of course, that may help you satisfy the ultimate goal of reducing the variation of Y.</p>
<p>When using these tools, don’t forget to try data transformations, especially when the sample variances increase as the sample means increase. Keep in mind that the p-values you see may be underestimated if the variances are not equal. If a t-test or ANOVA are used and the reported p-value is marginally significant, then the actual p-value may be marginally insignificant. When using ANY statistical tool, one should ALWAYS consider the practical significance of the result as well as the statistical significance of the result before passing final judgement.</p>
<h1>Improve Phase</h1>
<p>In the Improve Phase, the Black Belt will often use designed experiments to make dramatic improvements in the performance of the CTQ. A designed experiment is a procedure for simultaneously altering all of the leverage variables discovered in the Analyze Phase and observing what effects these changes have on the CTQ. The Black Belt must determine exactly which leverage variables are critical to improving the performance of the CTQ, and establish settings for those critical variables.</p>
<p>In order to determine whether an effect from a leverage variable, or an interaction between 2 or more leverage variables, is statistically significant, the Black Belt will often utilize an ANOVA table, or a Pareto chart of effects, or a normal plot of effects. The results from all of these are based upon the estimate of the error variance, whicother words, the sample sizes are all equal. Thus, the equal variances assumption can be relaxed for balanced experiments.</p>
<p>Still, it is good practice to check for radical departures from the equal variance assumption. In order to do so you must remember one thing – it is the variances of the residuals that are assumed to be equal, not the variances of the Y values. This means that you would need to organize the residuals into groups based upon the values of the factors, one group for each term in the model, and then get an estimate of the variance from each group. For a one-way ANOVA this is not too hard to do. For a two-way ANOVA, it is still not difficult. But, for more than two factors, this becomes increasingly difficult to do. An alternative would be to examine the residuals for extreme outliers, or plot the residuals against each factor as well against the fits. These plots will at least show whether any of the factors have main effects on the variability of the residuals, as well as determining whether the residual variance is correlated with the response.</p>
<p>Also, bear in mind that the p-values in the ANOVA table may be slightly underestimated, and that the cutoff line in the Pareto chart is slightly higher than it should be. In other words, some effects that are observed to be marginally statistically significant could actually be marginally insignificant. In any case where an effect is marginally significant, or marginally insignificant, from a statistical point of view, one should always ask whether it is of practical significance before passing final judgement.</p>
<h1>Considerations for the Equal Variance Assumption</h1>
<p>The use of statistical tools does not follow some exact cookbook. By their very nature, there is always an element of error associated with these tools. The same is true about the assumption of equal variances. Since the Y variable for both the two-sample t-test and ANOVA is the same for all samples, it is not likely that the variances will differ greatly from one sample to another. On the other hand, since one of the goals of a good project is to reduce the variation in Y, extreme differences in variability should be studied to determine why they occurred. It is not uncommon to observe that the variance of Y increases as the mean of Y increases, a condition that can be easily remedied with a data transformation. So, when assessing the validity of the equal variances assumption, keep in mind the points outlined earlier:</p>
<p>1.  Robustness when samples sizes are equal.</p>
<p>2.  Alternative procedures.</p>
<p>3.  Data transformations.</p>
<p>4.  The effects of unequal variances.</p>
<p>5.  Parametric tests vs non-parametric tests.</p>
<p>6.  Changes in variation can be desirable.</p>
<p><a class="a2a_dd addtoany_share_save" href="http://www.addtoany.com/share_save"><img src="http://sixsigmaz.com/wp-content/plugins/add-to-any/share_save_171_16.png" width="171" height="16" alt="Share/Bookmark"/></a> </p><h4 class='related-posts-header'>Related Posts</h4><ul class="related-posts-list"><li class="related-post"><a href="http://sixsigmaz.com/2009/08/24/normality-assumption-explained/">Normality Assumption Explained</a> <span class="related-post-date timestamp">Mon 24 Aug 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/08/26/six-sigma-glossary-definitions/">Six Sigma Glossary & Definitions</a> <span class="related-post-date timestamp">Wed 26 Aug 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/11/11/six-sigma-and-minitab-a-complete-toolbox-guide-for-all-six-sigma-practitioners/">Six Sigma and Minitab A complete Toolbox Guide for All Six Sigma Practitioners</a> <span class="related-post-date timestamp">Wed 11 Nov 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/10/15/introduction-to-dfss-design-for-six-sigma/">Introduction to DFSS Design for Six Sigma</a> <span class="related-post-date timestamp">Thu 15 Oct 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/08/26/six-sigma-and-baseball/">Six Sigma and Baseball</a> <span class="related-post-date timestamp">Wed 26 Aug 2009</span></li><li class="related-post"><a href="http://sixsigmaz.com/2009/08/24/101-things-every-six-sigma-black-belt-should-know/">101 Things Every Six Sigma Black Belt Should Know</a> <span class="related-post-date timestamp">Mon 24 Aug 2009</span></li></ul>]]></content:encoded>
			<wfw:commentRss>http://sixsigmaz.com/2009/08/22/equal-variance-assumption-explained/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>
