College of Science

Statistics

www.stat.vt.edu

E. P. Smith, Head
Professors: J. B. Birch; I. Hoeschele; J.P. Morgan; M. R. Reynolds, Jr.1;
E. P. Smith; G.G. Vining; W.H. Woodall
Associate Professors: G. I. Holtzman; G. R. Terrell
Assistant Professors: P. Du; X. Deng; F. Guo; Y. Hong; L. House; D. Kim; I. Kim; S. Leman
Assistant Research Professors: C. Franck; J. Li; E. Vance
Visiting Assistant Professor: A. Ryan
Instructors: L. M. Williams; M. Lemons

1Joint with Forestry

Stat Club hike

Overview

    Statistics courses are offered at both the undergraduate and the graduate levels for students preparing for professions in statistics, for students who need statistical tools to engage in scientific research, and for students who want to acquire knowledge of the important concepts of probability and statistical inference.

    Statistics courses for graduate students and programs leading to the M.S. and Ph.D. degrees in statistics are described in the Graduate Catalog and in a special bulletin available from the department.

Bachelor of Science in Statistics

    The Statistics major requires the completion of 38 credit hours in Statistics, 16 hours of Mathematics, 3 hours of Computer Science and 3 hours of Technical Writing. Specific requirements can be found on the department website.

    All statistics majors are required to own specified personal computers and software. Consult the department for details.

    A special brochure describing the department and the B.S. program, intended for prospective entering freshmen, is available from the department upon request.

    Cooperative Education and Internship positions are available in industry and government, offering valuable practical experience. Students participating in such an experience can receive academic credit which will count towards graduation requirements.

Minor in Statistics

    Requirements: 18 hours

    1. Completion of an introductory sequence: 3005-3006, 3615-3616 or 4705-4706; Note that 3604 may substitute for 3615 if taken before the minor was declared.
    2. 4204 or 4214; Note that 4214 can only count for part (2) or (3), not both.
    3. 6-9 credit hours from 3504, 4004, 4214, 4444, 4504, 4514, 4524, 4534, 4804, Math 4454, or ISE 4404; Note that Econ majors or minors may use Econ 4304 to substitute for Stat 4804.
    4. 3 hours for part (3) may come from 2004, providing it is the first statistics course taken.

    The department reserves the right to withhold credit if a student takes a course, the content of which is partially duplicated in a course already taken (see Course Duplications below).

Laboratory for Interdisciplinary Statistical Analysis (LISA)

    Associated with the statistics department, the Laboratory for Interdisciplinary Statistical Analysis (LISA) provides statistical assistance for research projects throughout the university community. Faculty members, staff, and students are available to aid in statistical design and analysis for any authorized research study here at the university and at other state agencies.

Satisfactory Progress

    University policy requires that students who are making satisfactory progress toward a degree meet minimum criteria toward the Curriculum for Liberal Education (see "Academics"), toward the College of Science Core (see first part of this chapter), and toward the degree in statistics. Satisfactory progress toward the B.S. in Statistics requires that:

    Upon having attempted 70 semester credits (including transfer, advanced placement, advanced standing, credit by examination, freshman rule), students must have completed with a course grade of C- or above:

    1. Upon having attempted 72 semester credits (including transfer, AP, advanced standing, credit by examination, course withdrawal) majors must have completed the following courses with grades of C- or better: STAT 3005, 3006; MATH 1205, 1206, 1114, 1224, 2224, and not have taken any MATH or STAT designated course for the degree more than twice, including attempts ending in course withdrawal.
    2. Upon having attempted 90 semester credits, students must have an in-major grade point average of 2.0 or above.

Course Duplications

    No credit will be given for 2004 if taken with or after any other statistics course; MASC 1034, STAT 3604 if taken with or after any statistics course except 2004, 3104. BIT 2405 may not be used as a substitute for credit as a statistics course unless the student was officially registered as a Business major at the time BIT 2405 was taken.

Computer Literacy

    Many statistics courses involve the use of statistics software, primarily MINITAB, SAS, JMP or R. Experience with the software is not expected, but students should have familiarity with either the Windows or Macintosh operating system and have access to a computer.

Course Projects

    Many of the upper-division courses include a project, generally to be completed in small groups. These projects are designed to give students the kind of insight and experience in realistic statistical practice that cannot be obtained in classroom lectures or short-term homework assignments.

Undergraduate Courses (STAT)

1004: THE FIRST YEAR EXPERIENCE IN LEARNING FROM DATA
Introduction to the field of statistics and aspects of college life for first year students. Topics included: history of the statistics; key roles of statisticians in field, such as actuarial sciences, pharmaceutical, medical, and bioinformatics industries, governmental agencies, academia; fundamental principles of statistical fields of study and applications; exploring data sets; and aspects of college life for first-year students. (2H,2C)

2004: INTRODUCTORY STATISTICS
Fundamental concepts and methods of statistics with emphasis on interpretation of statistical arguments. An introduction to design of experiments, data analysis, correlation and regression, concepts of probability theory, sampling errors, confidence intervals, and hypothesis tests.  (See also Course Duplications). Pre: MATH 1015 or MATH 1014. (4H,3C)

2524: DATA SCIENCE
Organize, summarize, and visualize large-scale datasets from web studies. Interpret visualizations and communicate information discovered by data explorations. Program in R or comparable statistics programming language. Not intended for statistics majors. Pre: (2004, 3005 or 3615), (MATH 1526, MATH 1114, MATH 1206 or MATH 2015), (CS 1044, CS 1054 or CS 1064).
(3H,3C)

2964: FIELD STUDY
Pass/Fail only. Variable credit course.

2974H: INDEPENDENT STUDY
Honors section.
Variable credit course.

2984: SPECIAL STUDY
Variable credit course.

3005-3006: STATISTICAL METHODS
3005: Basic statistical methodology: exploratory data techniques, estimation, inference, comparative analysis by parametric, nonparametric, and robust procedures. Analysis of variance (one-way), multiple comparisons, and categorical data.  3006: Analysis of variance, simple and multiple, linear and nonlinear regression, analysis of covariance.  Use of MINITAB. STAT 3005 duplicates STAT 3615 and STAT 4604, only one may be taken for credit. STAT 3006 duplicates STAT 3616, STAT 4604 and STAT 4706, only one may be taken for credit. Pre: MATH 1206 or MATH 1226 for 3005; 3005 for 3006. (3H,3C)

3094: SAS PROGRAMMING
Introduction to basic programming techniques: creating DATA and PROC statements, libraries, functions, programming syntax and formats.  Other topics include loops, SAS Macros and PROC IML.  Emphasis is placed on using these tools for statistical analyses.  The pre-requisite may be substituted for an equivalent course. Pre: 3005. (3H,3C)

3104: PROBABILITY AND DISTRIBUTIONS
Probability theory, including set theoretic and combinatorial concepts; in-depth treatment of discrete random variables and distributions, with some introduction to continuous random variables; introduction to estimation and hypothesis testing. Pre: (MATH 1206 or MATH 1226 or MATH 2015 or MATH 1026 or MATH 1526), (STAT 3005 or STAT 3615). (3H,3C)

3424: INTRODUCTION TO STATISTICAL NEUROSCIENCE AND IMAGE ANALYSIS
Analysis of data arising in studies in neuroscience and from fMRI neuroimaging. Topics include background on neuroscience and the brain, overview of structural and functional MRI data, introduction to MATLAB, overview of linear models, contrasts, and statistical parametric mapping, experimental design, and Bayesian analysis. Pre: 3006 or 3616. (3H,3C)

3504: NONPARAMETRIC STATISTICS
Statistical methodology based on ranks, empirical distributions, and runs.  One and two sample tests, ANOVA, correlation, goodness of fit, and rank regression, R-estimates and confidence intervals.  Comparisons with classical parametric methods.  Emphasis on assumptions and interpretation. Pre: 3006, 4106, 4604, 4706. (3H,3C)

3604: STATISTICS FOR THE SOCIAL SCIENCES
Statistical methods for nominal, ordinal, and interval levels of measurement.  Topics include descriptive statistics, elements of probability, discrete and continuous distributions, one and two sample tests, measures of association.  Emphasis on comparison of methods and interpretations at different measurement levels. (See also Course Duplications). Pre: MATH 1015 or MATH 1014. (3H,3C)

3615-3616: BIOLOGICAL STATISTICS
Descriptive and inferential statistics in a biological context.  3615: Fundamental principles, one- and two-sample parametric inference, simple linear regression, frequency data.  3616: One- and two-way ANOVA, multiple regression, correlation, nonparametrics, using the MINITAB computer package. STAT 3615 partially duplicates STAT 3005 and STAT 4604, only one may be taken for credit. STAT 3616 partially duplicate STAT 3006, 4604 and 4706, only one may be taken for credit. (3H,3C)

3654 (CMDA 3654) (CS 3654): INTRODUCTORY DATA ANALYTICS & VISUALIZATION
Basic principles and techniques in data analytics; methods for the collection of, storing, accessing, and manipulating standard-size and large datasets; data visualization; and identifying sources of bias.
Pre: CMDA 2006. (3H,3C)

3704: STATISTICS FOR ENGINEERING APPLICATIONS
Introduction to statistical methodology with emphasis on engineering experimentation: probability distributions, estimation, hypothesis testing, regression, and analysis of variance.  Only one of the courses 3704, 4604, 4705, and 4714 may be taken for credit. Pre: MATH 2224 or MATH 2204 or MATH 2204H. (2H,2C)

4004: METHODS OF STATISTICAL COMPUTING
Computationally intensive computer methods used in statistical analyses. Statistical univariate and multivariate graphics; resampling methods including bootstrap estimation and hypothesis testing and simulations; classification and regression trees; scatterplot smoothing and splines.
Pre: (4105, 4214). (4H,3C)

4024: COMMUNICATION IN STATISTICAL COLLABORATIONS
Theory and examples of effective communication in the context of statistical collaborations. Practice developing the communication skills necessary to be effective statisticians using peer feedback and self-reflection. Topics include helping scientists answer their research questions, writing about and presenting statistical concepts to a non-statistical audience, and managing an effective statistical collaboration meeting. Pre-requisite: Senior standing in the Department of Statistics
Pre: 4105, 4204. Co: 4214. (3H,3C)

4094: INTRODUCTION TO PROGRAMMING IN R
Introduction to R programming techniques with an emphasis on statistical analyses. Topics include: data objects, loops, importing/exporting datasets, graphics, functions, t-tests, ANOVA, linear regression, nonparametric tests, and logistic regression. Pre: 3615. (1H,1C)

4105-4106: THEORETICAL STATISTICS
4105: Probability theory, counting techniques, conditional probability; random variables, moments; moment generating functions; multivariate distributions; transformations of random variables; order statistics.  4106: Convergence of sequences of random variables; central limit theorem; methods of estimation; hypothesis testing; linear models; analysis of variance. STAT 4105 partially duplicates STAT 4705, STAT 4714, and STAT 4724, only one may be taken for credit. Pre: MATH 2224 for 4105; 4105 for 4106. (3H,3C)

4204: EXPERIMENTAL DESIGNS
Fundamental principles of designing and analyzing experiments with application to problems in various subject matter areas.  Discussion of completely randomized, randomized complete block, and Latin square designs, analysis of covariance, split--plot designs, factorial and fractional designs, incomplete block designs. Pre: 3006 or 3616 or 4106 or 4706 or 5605 or 5615. (3H,3C)

4214: METHODS OF REGRESSION ANALYSIS
Multiple regression including variable selection procedures; detection and effects of multicollinearity; identification and effects of influential observations; residual analysis; use of transformations.  Non-linear regression, the use of indicator variables, and logistic regression.  Use of SAS. Pre: 3006 or 3616 or 4106 or 4706 or 5606 or 5616. (3H,3C)

4364: INTRODUCTION TO STATISTICAL GENOMICS
Statistical methods for bioinformatics and genetic studies, with an emphasis on statistical analysis, assumptions, and problem-solving.  Topics include: commonly used statistical methods for gene identification, association mapping and other related problems.  Focus on statistical tools for gene expression studies and association studies, multiple comparison procedures, likelihood inference and preparation for advanced study in the areas of bioinformatics and statistical genetics. Pre: (3006, MATH 1206, CS 1044) or (STAT 3006, MATH 1206, CS 1054) or (STAT 3006, MATH 1206, CS 1114). (3H,3C)

4444: APPLIED BAYESIAN STATISTICS
Introduction to Bayesian methodology with emphasis on applied statistical problems: data displaying, prior distribution elicitation, posterior analysis, models for proportions, means and regression. Pre: MATH 2224, (STAT 3104 or STAT 4105 or STAT 4705), (STAT 3006 or STAT 3616 or STAT 4706). (3H,3C)

4504: APPLIED MULTIVARIATE ANALYSIS
Non-mathematical study of multivariate analysis. Multivariate analogs of univariate test and estimation procedures.  Simultaneous inference procedures. Multivariate analysis of variance, repeated measures, inference for dispersion and association parameters, principal components analysis, discriminate analysis, cluster analysis.  Use of SAS. Pre: 3006 or 4706 or 5606 or 5616. (3H,3C)

4514: CONTINGENCY TABLE ANALYSIS
Statistical techniques for frequency data.  Goodness-of-fit. Tests and measures of association for two-way tables. Log-linear models for multidimensional tables.  Parameter estimation, model selection, incomplete tables, ordinal categories, logistic regression.  Use of SAS and SPSSx. Pre: 3006 or 3616 or 4106 or 4706 or 5606 or 5616. (3H,3C)

4524: SAMPLE SURVEY METHODS
Statistical methods for the design and analysis of survey sampling.  Fundamental survey designs.  Methods of randomization specific to various survey designs. Estimation of population means, proportions, totals, variances, and mean squared errors. Design of questionnaires and organization of a survey. Pre: 3006 or 3616 or 4106 or 4706 or 5606 or 5616. (3H,3C)

4534: APPLIED STATISTICAL TIME SERIES ANALYSIS
Applied course in time series analysis methods. Topics include regression analysis, detecting and address autocorrelation, modeling seasonal or cyclical trends, creating stationary time series, smoothing techniques, forecasting and forecast errors, and fitting autoregressive integrated moving average models. Pre: 3006 or 4104 or 4706 or 4714 or 3616. (3H,3C)

4584 (MATH 4584): ADVANCED CALCULUS FOR STATISTICS
Introduction to those topics in advanced calculus and linear algebra needed by statistics majors.  Infinite sequences and series.  Orthogonal matrices, projections, quadratic forms.  Extrema of functions of several variables. Multiple integrals, including convolution and nonlinear coordinate changes. Pre: (MATH 1114 or MATH 2114 or MATH 2114H), (MATH 1205 or MATH 1225), (MATH 1206 or MATH 1226), (MATH 2224 or MATH 2204 or MATH 2204H). (3H,3C)

4604: STATISTICAL METHODS FOR ENGINEERS
Introduction to statistical methodology with emphasis on engineering applications: probability distributions, estimation, hypothesis testing, regression, analysis of variance, quality control.  Only one of the courses 4604, 4705, and 4714 may be taken for credit. STAT 4604 partially duplicates STAT 3005, STAT 3615, STAT 3006, STAT 3616 and STAT 4706.  Only one may be taken for credit. Pre: MATH 1206 or MATH 1226. (3H,3C)

4654 (CMDA 4654) (CS 4654): INTERMEDIATE DATA ANALYTICS AND MACHINE LEARNING
A technical analytics course. Covers supervised and unsupervised learning strategies, including regression, generalized linear models, regularization, dimension reduction methods, tree-based methods for classification, and clustering. Upper-level analytical methods shown in practice: e.g, advanced naive Bayes and neural networks: Pre: 2006; or equivalent. Pre: CMDA 2006. (3H,3C)

4664 (CMDA 4664): COMPUTATIONAL INTENSIVE STOCHASTIC MODLEING
Stochastic modeling methods with an emphasis in computing are taught.  Select concepts from the classical and Bayesian paradigms are explored to provide multiple perspectives for how to learn from complex, datasets.  There is particular focus on nested, spatial, and time series models.
Pre: CMDA 2006. (3H,3C)

4705-4706: PROBABILITY AND STATISTICS FOR ENGINEERS
Basic concepts of probability and statistics with emphasis on engineering applications.  4705: Probability, random variables, sampling distributions, estimation, hypothesis testing, simple linear regression correlation, one-way analysis of variance.  4706: Multiple regression, analysis of variance, factorial and fractional experiments.  Only one of the courses 3704, 4604, 4705, 4714, and 4724 may be taken for credit.
Pre: MATH 2224 or MATH 2204 or MATH 2204H for 4705; 4705 or 4105 for 4706. (3H,3C)

4714: PROBABILITY AND STATISTICS FOR ELECTRICAL ENGINEERS
Introduction to the concepts of probability, random variables, estimation, hypothesis testing, regression, and analysis of variance with emphasis on application in electrical engineering.  Only one of the courses 3704, 4604, 4705, 4714 and 4724 may be taken for credit. Pre: MATH 2224 or MATH 2204 or MATH 2204H. (3H,3C)

4804 (AAEC 4804): ELEMENTARY ECONOMETRICS
Economic applications of mathematical and statistical techniques: regression, estimators, hypothesis testing, lagged variables, discrete variables, violations of assumptions, simultaneous equations. Pre: (3005 or 3604), (AAEC 1006). (3H,3C) II.

4964: FIELD STUDY
Pass/Fail only. Variable credit course.

4974: INDEPENDENT STUDY
Variable credit course.

4974H: INDEPENDENT STUDY
Honors section. Variable credit course.

4984: SPECIAL STUDY
Variable credit course.

4994: UNDERGRADUATE RESEARCH
Variable credit course.

4994H: UNDERGRADUATE RESEARCH
Honors section. Variable credit course.



TOP