DATA SCIENCE CORE
Take the following courses:
DS-110 Intro to Data Science
This course introduces the student to the emerging field of data science through the presentation of basic math and statistics principles, an introduction to the computer tools and software commonly used to perform the data analytics, and a general overview of the machine learning techniques commonly applied to datasets for knowledge discovery. The students will identify a dataset for a final project that will require them to perform preparation, cleaning, simple visualization and analysis of the data with such tools as Excel and R. Understanding the varied nature of data, their acquisition and preliminary analysis provides the requisite skills to succeed in further study and application of the data science field. Prerequisite: comfort with pre-calculus topics and use of computers.
3 CreditsN
CS-110 Computer Science I
An introductory study of computer science software development concepts. Python is used to introduce a disciplined approach to problem solving methods, algorithm development, software design, coding, debugging, testing, and documentation in the object oriented paradigm. This is the first course in the study of computer science.
3 CreditsN,CTGES,CTGISRecommended programming experience or IT110 or IT100, IT111 or IM110 or MA103 but not necessary.
STATISTICS
Take one of the following courses:
MA-220 Introduction to Probability & Statistics
An introduction to the basic ideas and techniques of probability theory and to selected topics in statistics, such as sampling theory, confidence intervals, and linear regression.
4 CreditsN, QS, CTGESPrerequisite: MA130
MA-205 Elementary Statistics
Introduction to traditional statistical concepts including descriptive statistics, binomial and normal probability models, confidence intervals, tests of hypotheses, linear correlation and regression, two-way contingency tables, and one-way analysis of variance.
4 CreditsN, QS, WK-SPPrerequisite: FYC-101 or EN-110 or EN-109
EB-211 Business Statistics
This course covers basic descriptive and inferential statistics, normal curve and z-score computations, and addresses hypothesis testing using Chi-Square, T-Test, ANOVA, and linear regression modelling.
3 Credits QS,S
ESS-230 Environmetrics
This course is a survey of the various visual, statistical, and modeling approaches commonly used in the analysis of environmental data. The course covers: (1) visual literacy from exploratory data inquisition to poster creation; (2) elementary group comparison such as t-test and ANOVA and their non-parametric analogs;(3) basic systems modeling; and (4) regression modeling techniques based on the generalized linear model framework.
3 CreditsN, QS, CTGES, CTGISPrerequisites: Sophomore standing and permission of the instructor.
SW-215 Integrated Research Methods & Stats II
The second part of an integrated course sequence applying the scientific process to the fields of Social Work and Sociology, emphasizing key research concepts, commonly used quantitative and qualitative methods, and the ability to communicate effectively about research with written and verbal skills. The course teaches students not only to conduct research but also to consume and utilize research.
3 CreditsS
BI-305 Biostatistics
This course deals centrally with quantitative and statistical methodology in the biological sciences. It includes experimental design and the conventions of generating, analyzing, interpreting and presenting biological data. Counts as a math course for graduate and professional school requirements.
4 CreditsN, QS, CTGESPrerequisites: BI106 or ESS100
ESS-309 Econometrics
A first course in econometrics with forays into regression, optimization, and modeling.
2 CreditsN, QPrerequisites: Introductory economics course.
PY-361 Research Methods & Stats Psychology II
This course focuses on becoming a better research producer and a research consumer from a psychological science perspective. Students will learn to think critically about media claims and accurately summarize primary source articles about behavior. Students will learn to use statistical software to accurately describe data. Students will learn to communicate effectively about research through written and oral work and make ethical judgments informed by APA ethical standards. Students will design and execute their own individual research studies.
4 CreditsS, CW, QS
ADDITIONAL COURSES
Take twelve credits from the following courses:
CS-370 Database Management Systems
Focuses on concepts and structures necessary to design and implement a database management
system. Various modern data models, data security and integrity, and concurrency are
discussed. An SQL database system is designed and implemented as a group project.
3 CreditsN,CTGISPrerequisites: CS110.
IM-242 Info Visualization
This course considers the various aspects of presenting digital information for public
consumption visually. Data formats from binary, text, various file types, to relational
databases and web sites are covered to understand the framework of information retrieval
for use in visualization tools. Visualization and graphical analyses of data are considered
in the context of the human visual system for appropriate information presentation.
Various open-source and commercial digital tools are considered for development of
visualization projects.
3 CreditsN,CTDH,CTGESPrerequisite: IT 110, IT 111, IM 110, DS 110, or CS 110 or permission.
DS-210 Data Acquisition
Students will understand how to access various data types and sources, from flat file
formats to databases to big storage data architecture. Students will perform transformations,
cleaning, and merging of datasets in preparation for data mining and analysis.
3 CreditsNPRE-REQ: CS 110 and DS 110.
DS-352 Machine Learning
This course considers the use of machine learning (ML) and data mining (DM) algorithms
for the data scientist to discover information embedded in datasets from the simple
tables through complex and big data sets. Topics include ML and DM techniques such
as classification, clustering, predictive and statistical modeling using tools such
as R, Matlab, Weka and others. Simple visualization and data exploration will be covered
in support of the DM. Software techniques implemented the emerging storage and hardware
structures are introduced for handling big data.
3 CreditsNPrerequisite: CS-110, DS-110, and an approved statistics course from this list: MA-205,
MA-220, BI-305, PY-214, or EB- 211.
DS-375 Big Data
This course considers the management and processing of large data sets, structured,
semi-structured, and unstructured. The course focuses on modern, big data platforms
such as Hadoop and NoSQL frameworks. Students will gain experience using a variety
of programming tools and paradigms for manipulating big data sets on local servers
and cloud platforms.
3 CreditsNPrerequisites: DS 110 Intro to Data Science and CS 370 Database Management Systems
MA-321 Multivariate Statistics
A class in multivariate statistical techniques including non-parametric methods, multiple
regression, logistic regression, multiple testing, principle analysis.
3 CreditsN, QSPrerequisites: An introductory statistics course ( MA220 or BI305 or PY214 or EB211)
and linear algebra (MA 160) or Calculus 1 (MA 130).
MA-325 Statistical Consulting
The participating students will receive training during the semester in consulting
on statistical problems and to assist in collaborative efforts with faculty and/or
staff on client-partnered projects that are pre-determined. The semester-long project
provides the student with both real work experience in the field of statistics and
a project-based learning experience in partnership with the client. May be taken multiple
times for credit.
3 CreditsN, QS, CW, SW-LE Prerequisite: Take one of the following: BI-305 EB-211 ESS-230 ESS-309 MA-205 MA-220
PY-361 or SW-215. Also take FYC-101 or EN-110 or EN-109.
CS-370 Database Management Systems
Focuses on concepts and structures necessary to design and implement a database management system. Various modern data models, data security and integrity, and concurrency are discussed. An SQL database system is designed and implemented as a group project.
3 CreditsN,CTGISPrerequisites: CS110.
IM-242 Info Visualization
This course considers the various aspects of presenting digital information for public consumption visually. Data formats from binary, text, various file types, to relational databases and web sites are covered to understand the framework of information retrieval for use in visualization tools. Visualization and graphical analyses of data are considered in the context of the human visual system for appropriate information presentation. Various open-source and commercial digital tools are considered for development of visualization projects.
3 CreditsN,CTDH,CTGESPrerequisite: IT 110, IT 111, IM 110, DS 110, or CS 110 or permission.
DS-210 Data Acquisition
Students will understand how to access various data types and sources, from flat file formats to databases to big storage data architecture. Students will perform transformations, cleaning, and merging of datasets in preparation for data mining and analysis.
3 CreditsNPRE-REQ: CS 110 and DS 110.
DS-352 Machine Learning
This course considers the use of machine learning (ML) and data mining (DM) algorithms for the data scientist to discover information embedded in datasets from the simple tables through complex and big data sets. Topics include ML and DM techniques such as classification, clustering, predictive and statistical modeling using tools such as R, Matlab, Weka and others. Simple visualization and data exploration will be covered in support of the DM. Software techniques implemented the emerging storage and hardware structures are introduced for handling big data.
3 CreditsNPrerequisite: CS-110, DS-110, and an approved statistics course from this list: MA-205, MA-220, BI-305, PY-214, or EB- 211.
DS-375 Big Data
This course considers the management and processing of large data sets, structured, semi-structured, and unstructured. The course focuses on modern, big data platforms such as Hadoop and NoSQL frameworks. Students will gain experience using a variety of programming tools and paradigms for manipulating big data sets on local servers and cloud platforms.
3 CreditsNPrerequisites: DS 110 Intro to Data Science and CS 370 Database Management Systems
MA-321 Multivariate Statistics
A class in multivariate statistical techniques including non-parametric methods, multiple regression, logistic regression, multiple testing, principle analysis.
3 CreditsN, QSPrerequisites: An introductory statistics course ( MA220 or BI305 or PY214 or EB211) and linear algebra (MA 160) or Calculus 1 (MA 130).
MA-325 Statistical Consulting
The participating students will receive training during the semester in consulting on statistical problems and to assist in collaborative efforts with faculty and/or staff on client-partnered projects that are pre-determined. The semester-long project provides the student with both real work experience in the field of statistics and a project-based learning experience in partnership with the client. May be taken multiple times for credit.
3 CreditsN, QS, CW, SW-LE Prerequisite: Take one of the following: BI-305 EB-211 ESS-230 ESS-309 MA-205 MA-220 PY-361 or SW-215. Also take FYC-101 or EN-110 or EN-109.
Secondary Emphasis Credit Total = 18
Six credits must be at the 300/400-level. Any course exception must be approved by the advisor and/or department chair.