### DATA SCIENCE CORE

*Take the following courses:*

**CS-110 ** Computer Science I

An introductory study of computer science software development concepts. Python is used to introduce a disciplined approach to problem solving methods, algorithm development, software design, coding, debugging, testing, and documentation in the object oriented paradigm. This is the first course in the study of computer science.

3 CreditsN,CTGES,CTGIS**Recommended programming experience or IT110 or IT100, IT111 or IM110 or MA103 but
not necessary. **

**DS-110 ** Intro to Data Science

This course introduces the student to the emerging field of data science through the presentation of basic math and statistics principles, an introduction to the computer tools and software commonly used to perform the data analytics, and a general overview of the machine learning techniques commonly applied to datasets for knowledge discovery. The students will identify a dataset for a final project that will require them to perform preparation, cleaning, simple visualization and analysis of the data with such tools as Excel and R. Understanding the varied nature of data, their acquisition and preliminary analysis provides the requisite skills to succeed in further study and application of the data science field. Prerequisite: comfort with pre-calculus topics and use of computers.

3 CreditsN** **

**MA-130** Calculus I

An introduction to calculus including differentiation and integration of elementary functions of a single variable, limits, tangents, rates of change, maxima and minima, area, volume, and other applications. Integrates the use of computer algebra systems, and graphical, algebraic and numerical thinking.

4 CreditsN, QM

**MA-116** Discrete Structures

Introduces mathematical structures and concepts such as functions, relations, logic, induction, counting, and graph theory. Their application to Computer Science is emphasized.

4 CreditsN, Q**Pre-requisite high school algebra.**

**MA-160** Linear Algebra

An introduction to systems of linear equations, matrices, determinants, vector spaces, linear transformations, eigenvalues, and applications.

3 CreditsN, QM**Prerequisites: MA130.**

### STATISTICS CORE

*Take one of the following courses:*

*
*

**MA-220** Introduction to Probability & Statistics

An introduction to the basic ideas and techniques of probability theory and to selected topics in statistics, such as sampling theory, confidence intervals, and linear regression.

4 CreditsN, QS, CTGES**Prerequisite: MA130**

*
*

**MA-205** Elementary Statistics

Introduction to traditional statistical concepts including descriptive statistics, binomial and normal probability models, confidence intervals, tests of hypotheses, linear correlation and regression, two-way contingency tables, and one-way analysis of variance.

4 CreditsN, QS, WK-SP**Prerequisite: FYC-101 or EN-110 or EN-109**

**EB-211 ** Business Statistics

This course covers basic descriptive and inferential statistics, normal curve and z-score computations, and addresses hypothesis testing using Chi-Square, T-Test, ANOVA, and linear regression modelling.

3 Credits QS,S

**BI-305** Biostatistics

This course deals centrally with quantitative and statistical methodology in the biological sciences. It includes experimental design and the conventions of generating, analyzing, interpreting and presenting biological data. Counts as a math course for graduate and professional school requirements.

4 CreditsN, QS, CTGES**Prerequisites: BI106 or ESS100**

**ESS-230** Environmetrics

This course is a survey of the various visual, statistical, and modeling approaches commonly used in the analysis of environmental data. The course covers: (1) visual literacy from exploratory data inquisition to poster creation; (2) elementary group comparison such as t-test and ANOVA and their non-parametric analogs;(3) basic systems modeling; and (4) regression modeling techniques based on the generalized linear model framework.

3 CreditsN, QS, CTGES, CTGIS**Prerequisites: Sophomore standing and permission of the instructor.**

**ESS-309** Econometrics

A first course in econometrics with forays into regression, optimization, and modeling.

2 CreditsN, Q**Prerequisites: Introductory economics course.**

**PY-361** Research Methods & Stats Psychology II

This course focuses on becoming a better research producer and a research consumer from a psychological science perspective. Students will learn to think critically about media claims and accurately summarize primary source articles about behavior. Students will learn to use statistical software to accurately describe data. Students will learn to communicate effectively about research through written and oral work and make ethical judgments informed by APA ethical standards. Students will design and execute their own individual research studies.

4 CreditsS, CW, QS** **

**SW-215 ** Integrated Research Methods & Stats II

The second part of an integrated course sequence applying the scientific process to the fields of Social Work and Sociology, emphasizing key research concepts, commonly used quantitative and qualitative methods, and the ability to communicate effectively about research with written and verbal skills. The course teaches students not only to conduct research but also to consume and utilize research.

3 CreditsS** **

### SECOND-LEVEL

*Take the following courses:*

*
*

**CS-370 ** Database Management Systems

Focuses on concepts and structures necessary to design and implement a database management system. Various modern data models, data security and integrity, and concurrency are discussed. An SQL database system is designed and implemented as a group project.

3 CreditsN,CTGIS**Prerequisites: CS110. **

*
*

**DS-210 ** Data Acquisition

Students will understand how to access various data types and sources, from flat file formats to databases to big storage data architecture. Students will perform transformations, cleaning, and merging of datasets in preparation for data mining and analysis.

3 CreditsN**PRE-REQ: CS 110 and DS 110. **

**DS-352 ** Machine Learning

This course considers the use of machine learning (ML) and data mining (DM) algorithms for the data scientist to discover information embedded in datasets from the simple tables through complex and big data sets. Topics include ML and DM techniques such as classification, clustering, predictive and statistical modeling using tools such as R, Matlab, Weka and others. Simple visualization and data exploration will be covered in support of the DM. Software techniques implemented the emerging storage and hardware structures are introduced for handling big data.

3 CreditsN**Prerequisite: CS-110, DS-110, and an approved statistics course from this list: MA-205,
MA-220, BI-305, PY-214, or EB- 211. **

**IM-242 ** Info Visualization

This course considers the various aspects of presenting digital information for public consumption visually. Data formats from binary, text, various file types, to relational databases and web sites are covered to understand the framework of information retrieval for use in visualization tools. Visualization and graphical analyses of data are considered in the context of the human visual system for appropriate information presentation. Various open-source and commercial digital tools are considered for development of visualization projects.

3 CreditsN,CTDH,CTGES**Prerequisite: IT 110, IT 111, IM 110, DS 110, or CS 110 or permission. **

**MA-321** Multivariate Statistics

A class in multivariate statistical techniques including non-parametric methods, multiple regression, logistic regression, multiple testing, principle analysis.

3 CreditsN, QS**Prerequisites: An introductory statistics course ( MA220 or BI305 or PY214 or EB211)
and linear algebra (MA 160) or Calculus 1 (MA 130).**

**DS-375 ** Big Data

This course considers the management and processing of large data sets, structured, semi-structured, and unstructured. The course focuses on modern, big data platforms such as Hadoop and NoSQL frameworks. Students will gain experience using a variety of programming tools and paradigms for manipulating big data sets on local servers and cloud platforms.

3 CreditsN**Prerequisites: DS 110 Intro to Data Science and CS 370 Database Management Systems **

### CAPSTONE

*Take the following course:*

**MA-325** Statistical Consulting

The participating students will receive training during the semester in consulting on statistical problems and to assist in collaborative efforts with faculty and/or staff on client-partnered projects that are pre-determined. The semester-long project provides the student with both real work experience in the field of statistics and a project-based learning experience in partnership with the client. May be taken multiple times for credit.

3 CreditsN, QS, CW, SW-LE** Prerequisite: Take one of the following: BI-305 EB-211 ESS-230 ESS-309 MA-205 MA-220
PY-361 or SW-215. Also take FYC-101 or EN-110 or EN-109.**

### COGNATE AREA

*Take 12 credits, 3 of which must be at the 300 level or higher. Cognate area should
be a coherent set of courses outside the areas of Data Science, Math and Computer
Science. *

#### What should you expect?

Students in the data science program will be prepared for jobs dealing with data in whatever fields they are interested. With an emphasis on practical skills for the organization, analysis, visualization, and presentation of actionable information gathered from widely varied data sources, data science will work with students on real world data. Students will take a variety of courses in data science, computer science, statistics, and in a cognate area of their choice.

As part of the POE in data science you can participate in internships at locations such as Mutual Benefit Corporation or Juniata’s Office of Advancement.

**What your four years in the Data Science Program at Juniata College might look like:**

##### First Year

Take Introduction to Data Science (DS 110), Discrete Structures (MA 116), Computer Science 1 (CS 110), and Calculus (MA 130). Begin exploring other fields such as business, biology, environmental science, psychology, or history as a possible area to apply your data analysis skills, a cognate area.

##### Sophomore Year

Take Data Acquisition (DS 210), Linear Algebra (MA 160), and Introduction to Probability and Statistics (MA 220). Start taking courses in chosen cognate area.

##### Junior Year

Take upper level courses in data science, computer science, and statistics. Continue taking cognate area courses. Consider studying abroad at the Mathematical Sciences Semesters at Guanjuato, Mexico. Look into internships Participate in DataFest.

##### Senior Year

Take Data Science Consulting (DS 325) to have capstone in Data Science of a real life
data analysis project. Continue taking upper levels and finish your cognate area courses.

Complete an internship. Participate in Data Fest.

#### POE Credit Total = 54

Students must complete at least 18 credits at the 300/400-level. Any course exception must be approved by the advisor and/or department chair.