Introduction to Biostatistics Prof. Ho Kim (김호hosting03.snu.ac.kr/~hokim/int/2014/chap1.pdf ·...

Introduction to Biostatistics Prof. Ho Kim (김호) lecture Thur 10:00-12:00 6:30-8:30 lab Thur 1:00-3:00 8:30-10:30 통계학의 이해, 이승욱 편저, 자유아카데미 Biostatistics: A Foundation for Analysis in the Health Sciences, W.W. Daniel, Wiley slide files are available at http://hosting03.snu.ac.kr/~hokim/ -> 열린강의실 English ~~

• We will learn

– Basic concepts of statistics

– Application to public health researches

– Statistical software (SAS and R)

• Problems will be given at the end of every lecture

• Due on the next lecture (One week later)

• Home works (30)%, midterm exam (30%), final exam (30%), etc (10%)

• Office hour 9:00-10:00 5:00-6:00 on Thursday at 221-209 or 220-706

Statistical Software (SAS and R)

http://dss.princeton.edu/training/RStata.pdf

R을 이용해서 누구나하는 통계분석, 안재형, 한나래

case- sensitive

• 기술통계학 (descriptive statistics)

• 추측통계학 (inferential statistics)

• 통계학 (statistics):

1) 자료를 수집, 정리, 요약

to collect, organize, and summarize data

2) 자료의 일부만 관찰하여서 그 자료의 출처가 되는 전체 자료의 특성에 관한 추측

Observe only a portion, draw information on the whole

1.2 basic concepts

• 생통계학 (Biostatistics)

– 통계학은 다양한 분야에서 사용되고 있음 (경제학, 심리학, 공학,…등등)

– Statistics are used in many fields (economics, psychology, engineering, etc)

– 생명과학 분야의 자료가 가지는 특성

– Characteristics of biological data

– 특히 의생명, 보건학 자료가 가지는 특성

– Public health & biomedical data

– 미국의 예

– US examples

1.2 basic concepts

• 변수 (variables)

– 생통계분야 연구의 중요한 목적 중 하나는 인간의 건강에 영향을 미치는 요인과 그 이유를 설명하는 것이다.

– One of major purposes of statistics in public health is to find out the factors and explain how

– 만약 모든 사람들이 같은 값을 가지는 결과가 있다면 이와 관련이 있는 설명변수를 찾기는 불가능하다.

– If everyone has one value (constants, not variable), it is impossible to see different health outcomes by this factor

1.2 basic concepts

• 변수 (variables) – 이를 위해서 사람들마다 다른 값을 가지는 건강 변수와 위험

요인 변수들을 관찰한다.

• 양적변수 (quantitative variables): 숫자로 표시되는 경우

• 질적변수 (qualitative variables) : 수량화하기 어려운 경우

• 확률변수 (random variable): 측정한 변수가 우연성(확률)에 의해서 관찰되는 경우 (values of the variable are observed with certain probability rules) – 이산확률변수 (discrete random variable): 불연속적으로 관

찰되는 경우

– 연속확률변수 (continuous random variable): 연속적인 값으로 주어지는 경우

1.2 basic concepts

• 모집단 (population) – 연구의 궁극적 목표 (target of the study)

– 각각의 구성인자로 이루어진 집단 (composed with individual elements)

– 유한, 혹은 무한 (finite or infinite)

• 표본 (sample) – 모집단을 전수조사하는 것이 적당하지 않다면 표본을 뽑아

야 함

– We may investigate all elements of the population. Or we may select samples.

– 표본을 과학적으로 추출하는 것이 매우 중요함

– How to select samples is extremely important.

Representativeness(1)

Basic concepts of sampling survey

• 전수조사 혹은 총조사 (census) vs. 표본조사 (sampling survey) : 전수조사가 불가능하거나 혹은 더 정확하지 않을 수도 있음 (Census is not always more accurate.)

• 대상모집단(target population) and 추출모집단(sampling population): may be different (ex. Telephone survey)

• 표본오차 (sampling error): 일부만을 조사하기 때문에 발생 (happens because we observe sample, not the whole) always positive, statistical, probabilistic, theoretically derivable,

• 비표본오차(non-sampling error) 혹은 비표집오차: 그 이외의 오차

– 대상모집단과 추출모집단의 차이 (difference between target population and the sample)

– 설문지 결함 (Faults of the questionnaire)

– 무응답오차 (non-response)

– 기타 (조사원 불성실, 조사단위의 누락, 자료처리 과정에서의 오류 등), etc (interviewer’s error, deletion from the survey, data processing error)

Basic concepts of sampling survey

1.4 organizing data

보기(example)

Age of the

patients

Sort by the age

1.5 frequency table

• Sturge’s rule : k=1+3.222(log 10 n)

k=# of classes, n=# sample

width of the classes : w=R/k

R=range (max-min)

Example 1.5.1

• K=1+3.322(log57) =about 7

• R/k=(79-12)/7=9.6

= about 10

• Frequency table

• histogram

Population & sample

• population

• parameters 2( , )N

• sample

• estimates

1, , nY Y

S Y Yn

• parameter: constants which determine statistical properties of the assumed model

Y=a+b x 2( , )N

1 ( )exp

1.6 measuring central location

• statistic<- sample,

• parameter<-population

• Mean of the pop

(parameter)

• Mean of the sample

(statistics)

• 중위수는 평균과 달리 극단적인 값에 영향을 훨씬 덜 받는다. (median은 robust한 통계량이다) : 자료가 홀수개이면 가운데 값이고 짝수개이면 가운데 2개의 평균값이다.

• Median is less sensitive (robust) to the outliers:

• 최빈수 (mode)

1.6 measuring central location

1.7 Measuring variability

• 산포성: 범위, 분산(표준편차), 변이계수

• Variability: range, variance(standard deviation), CV (coefficient of variation)

• 범위 = 최대값-최소값, Range=Max-Min

• 모집단의 분산

• (Variance of the pop)

• 표본의 분산

• Variance of the sample

• 변이계수 CV=

• SD’s are similar

• CV 10/145*100=6.9 (%)

10/80*100=12.5 (%)

(100)s

1.7 Measuring variability

1.8 measuring central location for grouped data

2086.536.6

1.9 variance and SD fro grouped data

( )13347.37

238.345956

homework

• 연습문제 1.9.1 1.9.2 1.9.3

• 종합문제 22 23 28

• English translation file is also available at my web site.

Introduction to Biostatistics Prof. Ho Kim (김호hosting03.snu.ac.kr/~hokim/int/2014/chap1.pdf ·...

Documents

Nang luc sinh ke phu nu khmer ho kim thi

KIM, Nam-Ho; SANKAR, - Univates · 2020. 2. 13. · KIM, Nam-Ho; SANKAR, Bhavani V. 978-85-216-1959-8 Grupo GEN 27/11/2019 Remoção Fundações e Estruturas de Contenção BUDHU,

BM Tahun 3 (Unit 1 Akt Keluarga Kim Ho)

Principles of Biostatistics Chapter 17 Correlation

Biostatistics 602 - Statistical Inference Lecture 15 Bayes ... · Recap. . . . . . . Bayesian Statistics. . . . . . . Bayes Estimator. . . . . . Conjugate Family. Summary.. Biostatistics

Basics in Epidemiology & Biostatistics 1 RSS6 2014

Biostatistics Constantin Yiannoutsos

CPH Study Session - Biostatistics...CPH Study Session - Biostatistics Sumihiro (Sumi) Suzuki, PhD Associate Professor and Interim Chair Department of Biostatistics and Epidemiology

PUBH 8403: Research Skills in Biostatistics [10pt ...brad/8400/Wolfson_simulationstudy.pdf · PUBH 8403: Research Skills in Biostatistics Simulation Studies & Reproducibility Julian

biostatistics homework

Biostatistics Case Studies

Ho bieu chanh chua tau kim quy

Biostatistics - Normal Distribution

Biostat 2014 Frazee Biostatistics Kxt053

Prof Carrol Gamble Dept Biostatistics University of Liverpool

1 2 Preparation of Natural Seasoning using Enzymatic …—ˆ민수.pdf · 2015-02-16 · Jeong Gyun Kim, Yuni Noh, Kwon Hyun Park, Ji Sun Lee, Hyeon Jeong Kim, Min Ji Kim, Moo Ho

Introduction to Biostatistics Prof. Ho Kim (김호hosting03.snu.ac.kr/~hokim/int/2017/chap_0.pdf · 2017-03-06 · Introduction to Biostatistics Prof. Ho Kim (김호) lecture Mon

Strategies to fit pattern-mixture models - Biostatistics

Biostatistics 602 - Statistical Inference Lecture 06 Basu ... · Complete Statistics. . . . . . . . . Basu’s Theorem. Summary.. Biostatistics 602 - Statistical Inference Lecture

Review of Web 2.0 Services mobile Kim Min Jae. Hong Su Lyn. Kim Youn Ho. Yun Seong Geun