์ „์ฒด ๊ธ€

Python, C++, Data Science ๊ณต๋ถ€ ๋ธ”๋กœ๊ทธ ์ž…๋‹ˆ๋‹ค.
When there are two numerical variables, there are  TrendPositive associationNegative association PatternAny discernible "shape" in the scatterLinear Non-linear Visualize, then quantify  The Correlation Coefficient rMeasures linear association. It is based on the standard units. r is defined as:The average of product of (x in standard units) and (y in standard units) ํ‘œ์ค€ ๋‹จ์œ„ x์™€ ํ‘œ์ค€ ๋‹จ์œ„ y์˜ ๊ณฑ์˜ ํ‰๊ท   In P..
Confidence Interval is the interval of estimates of a parameter.It's based on random sampling. ๊ฐ€์žฅ ๋„๋ฆฌ ์‚ฌ์šฉ๋˜๋Š” ๊ฒƒ์€ 95% Confidence Interval ์ž…๋‹ˆ๋‹ค. Here, '95%' is called the confidence level.it could be any percent between 0 - 100.Higher confidence level means wider intervals.  Confidence interval can be considered 'Good' if it contains the parameter.The confidence is in the process that creates the inter..
A/B testing is a type of experiment in Data Science that compare values of sampled individuals in Group A with values of sampled individuals in Group B.Q. Do the two sets of values come from the same underlying distribution?  ์˜ˆ๋ฅผ ๋“ค์–ด, ๋Œ€ํ•œ๋ฏผ๊ตญ์˜ A ์ง€์—ญ์—์„œ ์ƒ˜ํ”Œ๋ง์„ ํ†ตํ•ด ์ธก์ •ํ•œ ํ‰๊ท  ์‹ ์žฅ์ด 165cm, B ์ง€์—ญ์—์„œ๋Š” 170cm๋ผ๊ณ  ํ•ด๋ณด๊ฒ ์Šต๋‹ˆ๋‹ค. (observed statistic)์—ฌ๊ธฐ์„œ, A์™€ B ์ง€์—ญ์˜ ํ‰๊ท  ์‹ ์žฅ ์ฐจ์ด๊ฐ€ same underlying distribution (๋Œ€ํ•œ๋ฏผ๊ตญ ์ „์ฒด ์‹ ์žฅ ๋ถ„ํฌ) ์—์„œ ๋น„๋กฏ๋œ ๊ฒƒ์œผ๋กœ ํŒ๋‹จ..
Data science์—์„œ๋Š” population์˜ unknown parameter๋ฅผ estimate ํ•˜๋Š” ๊ฒƒ์ด ๋ชฉํ‘œ์ผ ๋•Œ๊ฐ€ ๋งŽ์Šต๋‹ˆ๋‹ค. ์˜ˆ๋ฅผ ๋“ค์–ด ์ „ ๊ตญ๋ฏผ์˜ ์†Œ๋“์„ estimate ํ•˜๊ณ  ์‹ถ๋‹ค๊ณ  ํ•ด๋ณด๊ฒ ์Šต๋‹ˆ๋‹ค. ์ค‘์œ„ ์†Œ๋“์„ ๊ตฌํ•ด์„œ ์ด๋ฅผ ์ง€ํ‘œ๋กœ ์‚ฌ์šฉํ•˜๋ ค๊ณ  ํ•œ๋‹ค๊ณ  ํ•˜๊ฒ ์Šต๋‹ˆ๋‹ค. 1. If you have a census: Just calculate the parameter from the census, and you're done. Population ๋ฐ์ดํ„ฐ๊ฐ€ ์ค€๋น„ ์™„๋ฃŒ ๋˜์—ˆ๋‹ค๋ฉด, ๋ฐ”๋กœ ๊ณ„์‚ฐ๋งŒ ํ•˜๋ฉด ๋ฉ๋‹ˆ๋‹ค. ํ•˜์ง€๋งŒ, ์ด๋Ÿฐ ๊ฒฝ์šฐ๊ฐ€ ๋‹น์—ฐํžˆ ํ”ํ•˜์ง€ ์•Š๊ฒ ์ฃ ?  2. If you don't have a census: Take a random sample from the population. Usa a statistic as..
Chan Lee
Chan Code & DS ๐Ÿง‘‍๐Ÿ’ป๐Ÿ“Š