Skip to main content Skip to local navigation
Home » Undergraduate Programs » Data Science

Data Science

Change the world through data

Data Science is a booming field that uses computing and statistical reasoning to generate valuable insights from data. It has emerged as a key competence for any organization, including both businesses and governments. 

As a student in the Data Science program at York University (Toronto, Ontario), you will master the statistical methods, computation skills and data analysis techniques that enable data scientists to extract knowledge from data. In your studies, you will become familiar with the nature and needs of analyzing large and complex data through case studies in specific domains such as business, health, and digital media, as well as through a capstone experience that engages students in research with data in an industrial setting. 

Degrees Offered

Honours Major BA Program

Honours Major BSc Program

Recommended course enrollment schedules for Data Science students:

Year 1

Fall TermWinter Term
Math 1130 (Introduction to Data Science)Math 1131 (Introduction to Statistics I)
EECS 1015 (Computer Science and Programming in Python)EECS 1516 (Object Oriented Methods using Python)
Math 1013 (Applied Calculus I)Math 1014 (Applied Calculus II)
Math 1019 (Discrete Mathematics)Math 1025 (Applied Linear Algebra)
NATS (Gen Ed)NATS (Gen Ed)

Year 2

Fall TermWinter Term
Math 2030 (Elementary Probability)Math 2130 (Data Science)
Math 2015 (Applied Multivariable Calculus)Math 2131 (Introduction to Statistics II)
stream courseEECS 2101 / ITEC 2620 (Introduction to Data Structures)
WRIT 2202 (Effective Communication for Data Science)Humanities (Gen Ed)
Humanities (Gen Ed)Other (Gen Ed)

Year 3

Fall TermWinter Term
Math 3330 (Regression Analysis)Math 3333 (Data Analytics - Hands-on Approach)
ITEC 3221 (Database Management)ITEC 3310 (Data Visualization)
stream courseSocial Sciences (Gen Ed)
Other (Gen Ed)Social Sciences (Gen Ed)
electiveelective

Year 4

Fall TermWinter Term
Math 4036 (Statistical Machine Learning)Math 4931 (Simulation and the Monte Carlo Methods)
Math 4949 CapstoneMath 4949 Capstone
stream courseelective
PHIL 3500 (Ethics of Data Science)elective
electiveelective

At least 6 courses must be at 4000 level and at least 6 more at 3000 or 4000 level.

Electives might include EECS 3101 (algorithms), EECS 4415 (big data), EECS 4404 (machine learning),MATH 4130B (time series), MATH 4730 (experimental design), Math 4330 (categorical data), etc.

Year 1

Fall TermWinter Term
Math 1130 (Introduction to Data Science)Math 1131 (Introduction to Statistics I)
EECS 1015 (Computer Science and Programming in Python)EECS 1516 (Object Oriented Methods using Python)
Math 1013 (Applied Calculus I)Math 1014 (Applied Calculus II)
Math 1019 (Discrete Mathematics)Math 1025 (Applied Linear Algebra)
foundational sciencefoundational science

Year 2

Fall TermWinter Term
Math 2030 (Elementary Probability)Math 2130 (Data Science)
Math 2015 (Applied Multivariable Calculus)Math 2131 (Introduction to Statistics II)
stream courseEECS 2101 / ITEC 2620 (Introduction to Data Structures)
WRIT 2202 (Effective Communication for Data Science)science credit
science creditscience credit

Year 3

Fall TermWinter Term
Math 3330 (Regression Analysis)Math 3333 (Data Analytics - Hands-on Approach)
ITEC 3221 (Database Management)ITEC 3310 (Data Visualization)
stream coursescience credit
science creditnon-science credit
non-science creditelective

Year 4

Fall TermWinter Term
Math 4036 (Statistical Machine Learning)Math 4931 (Simulation and the Monte Carlo Methods)
Math 4949 CapstoneMath 4949 Capstone
stream courseelective
PHIL 3500 (Ethics of Data Science)non-science credit
electiveelective

At least 14 courses must be 3000 or 4000 level.

Electives might include EECS 3101 (algorithms), EECS 4415 (big data), EECS 4404 (machine learning),MATH 4130B (time series), MATH 4730 (experimental design), Math 4330 (categorical data), etc.

1st Year Fall Term (Sep to Dec)1st Year Winter Term (Jan to Apr)
MATH 1130
Introduction to Data Science
Mon, Wed, Fri 11:30 AM-12:30 PM
MATH 1131
Introduction to Statistics
Tue, Thu 10:00 AM-11:30 AM
EECS 1015
Computer Science and Programming in Python
Tue, Thu 2:30 PM-4:30 PM
EECS 1516
Object Oriented Methods using Python
Tue, Thu 2:30 PM-4:30 PM
MATH 1019
Discrete Mathematics
Mon, Wed, Fri 2:30 PM-3:30 PM
MATH 1025
Applied Linear Algebra
Mon, Wed, Fri 2:30 PM-3:30 PM
MATH 1013
Applied Calculus I
Tue, Thu 10:00 AM-11:30 AM
MATH 1014
Applied Calculus II
Mon, Wed, Fri 12:30 PM-1:30 PM
An additional course in Arts or ScienceAn additional course in Arts or Science

Streams

A unique aspect of our program, compared to other Data Science programs in Ontario, is our streams. You will be required to select a stream or specialization in a domain of application. Along with the required courses in statistics, computing and mathematics, you will also take courses in your selected stream, which could be any of the fields below.

woman holding a calculator and a pen

BUSINESS

Take courses from Schulich School of Business in marketing, finance, accounting, and management.

Business Stream Courses:

  • FINE 2000 (Introduction to Finance)
  • MKTG 1030 (Marketing Management)
  • MGMT 1000 (Introduction to Business)
  • ACTG 2010 (Financial Accounting)
  • OMIS 2010 (Operations and Supply Chain Management)
Healthcare worker texting on a phone

HEALTH

Take courses from the School of Health Policy and Management in health informatics, healthcare quality and decision making.

Health Stream Courses:

  • HLST 1010 (Found. Health Studies I)
  • HLST 1011 (Found. Health Studies II)
  • HLST 2040 (Health Informatics)
  • HLST 3350 (Health Data Analytics, Machine Learning and AI)
  • HLST 4310 (Analysis & Design of Health Information Systems) or HLST 4330 (Decision Making and Decision Support Systems in Healthcare)
Two students in a course from the School of the Arts, Media, Performance & Design

COMPUTATIONAL ARTS

Take courses from the School of the Arts, Media, Performance & Design in interactive digital media and video game development.

Computational Arts Stream Courses:

  • DATT 1010 (Interactive Digital Media I)
  • DATT 1020 (Interactive Digital Media II)
  • DATT 2300 (Game Design and Prototyping I)
  • DATT 2310 (Game Design and Prototyping II)
students crowded around a computer

OPTIMIZATION

Take courses on operations research in the Department of Mathematics & Statistics.

Optimization Stream Courses:

  • MATH 3171 (Linear Optimization)
  • MATH 3172 (Combinatorial Optimization)
  • MATH 4171 (Nonlinear Optimization)
  • MATH 4172 (Applied Decision Models)
laptop screen showing a data chart

COMPUTATION

Take courses on numerical and computational methods in the Department of Mathematics & Statistics.

Computation Stream Courses:

  • MATH 2041 (Symbolic Computation Lab)
  • MATH 3241 (Numerical Methods I)
  • MATH 3090 (Computational Math)

Woman presenting in front of a whiteboard

Capstone Course

The capstone experience further differentiates this program from others in Ontario by bringing together students’ statistical, computing, and stream knowledge to work on a real-life problem from industry or the public sector. The full-year capstone course, taken in the fourth year, will provide you with hands-on research experience before graduation. You will meet with professionals from companies and organizations to learn about their problems related to data science and work together on a solution, reporting back to your instructor regularly on your project progress.

This experience will give you valuable, real-world experience – an edge when seeking employment after graduation.

Career-focused

Prospective employers were consulted in this program’s design to ensure career success after graduation. Our Data Science program offers a middle ground between computer science and statistics, to ensure you are well prepared to obtain meaningful employment in data science-related positions in a wide range of industries.

You will take courses in computer programming (Python, R), communications, data visualization, data structures, machine learning, data analytics, ethics, and more. You will learn both the theoretical and applied perspectives of data science technologies and be knowledgeable in your stream subject. The capstone course will provide real-world experience that will open doors to internships and employment opportunities.

man in front of computer screen

For a preview of information on the Data Science program, please see the Data Science channel of the Come Study Math & Stats at York Discord Server


Who will you work with in Data Science?

Dr. Jairo Diaz-Rodriguez

Dr. Jairo Diaz-Rodriguez

Dr. Diaz-Rodriguez’s research interests are in data science, machine learning, high-dimensional statistics, optimization and big data. Most of his research contains both theoretical development and practical applications, with strong interdisciplinary components, and cloud and parallel computing implementations. He will be teaching MATH1130 in the fall term.

Dr. Kelly Ramsay

Dr. Kelly Ramsay

Dr. Ramsay works mainly in robust statistics and nonparametric statistics. Under this umbrella, she has worked on functional data, change-point problems and data privacy. Some of her areas of application are medical imaging, such as f-MRI, finance, and speech recognition. She has also worked on data collection and web-scraping in the past. She will be teaching MATH1130 in the winter term.

Dr. Kevin McGregor

Dr. Kevin McGregor

Dr. McGregor’s research has focused on biostatistics and various topics on statistical methods in genomics. In the past he has developed novel Bayesian methods to model community structure in the human microbiome by means of network and diversity estimation. He has also done work in methods for adjusting for cell-type heterogeneity in DNA methylation studies.

Dr. Steven Wang

Dr. Steven Wang

Dr. Wang has research experience in data science with applications to health science including biostatistics and bioinformatics. His current research is focused on data science for health care such as analysis of ECG, EEG and PPG data from wearable devices. He also has thirty years of consulting experience in data science.

Dr. Yuejiao Cindy Fu

Dr. Fu is the Statistics Director in the Department of Mathematics and Statistics. Her main research interests are mixture models, empirical likelihood, density ratio models, and statistical analysis of high-dimensional data, spatial data, genetic data, gene expression data and DNA methylation data.

Dr. Xin Gao

Dr. Xin Gao

Dr. Gao’s research areas include biostatistics, computational statistics, statistical theory, machine learning, high-dimensional data analysis, and data science. She have published over 60 publications in peer reviewed statistical journals. In the past, she has served as the president of the Southern Ontario Chapter of Statistical Society of Canada, associate editor of the Canadian Journal of Statistics, and a board member of the Statistical Society of Canada. She has collaborated with biomedical companies on various machine learning projects.

Amy Wu

Dr. Yuehua Amy Wu

Dr. Wu is an elected member of the International Statistical Institute. She has co-authored 130 papers, published in Proceedings of the National Academy of Sciences, the Journal of Applied Statistics, and the Journal of Multivariate Analysis, among others. Professor Wu's research centers on statistical theory and methods in data science and statistical machine learning. Her current research interests include robust statistics, model selection, multiple change-point analysis, multivariate analysis, spatio-temporal modeling, high-dimensional statistics, financial econometrics, data mining, and computational algorithms.