What is a bootcamp?

Our facilitated bootcamps focus on rapid skill acquisition by progressing you through a standard course on an accelerated schedule with peers who are committed to progressing on pace. Our bootcamps include:

  • Live kick-off event
  • Instructor facilitated Q&A for expert feedback and coaching
  • Learner Success Support: welcome call, advising sessions, personalized pace reminders
  • 24/7 help desk

About This Course:

In this bootcamp you will learn how data scientists exercise statistical thinking in designing data collection, derive insights from visualizing data, obtain supporting evidence for data-based decisions and construct models for predicting future trends from data.

This course can be used towards completion of a Professional Certificate in Data Science for Executives.

What You Will Learn:

  • Data collection, analysis and inference
  • Data classification to identify key traits and customers
  • Conditional Probability-How to judge the probability of an event, based on certain conditions
  • How to use Bayesian modeling and inference for forecasting and studying public opinion
  • Basics of Linear Regression
  • Data Visualization: How to create use data to create compelling graphics

Meet Your Instructors:

Andrew Gelman

Andrew Gelman is a professor of statistics and political science and director of the Applied Statistics Center at Columbia University. He has received the Outstanding Statistical Application award from the American Statistical Association, the award for best article published in the American Political Science Review, and the Council of Presidents of Statistical Societies award for outstanding contributions by a person under the age of 40. Andrew has done research on a wide range of topics, including: why it is rational to vote; why campaign polls are so variable when elections are so predictable; why redistricting is good for democracy; reversals of death sentences; police stops in New York City, the statistical challenges of estimating small effects; the probability that your vote will be decisive; seats and votes in Congress; social network structure; arsenic in Bangladesh; radon in your basement; toxicology; medical imaging; and methods in surveys, experimental design, statistical inference, computation, and graphics.
David Madigan - Pearson Advance

David Madigan

David Madigan received a bachelor’s degree in Mathematical Sciences and a Ph.D. in Statistics, both from Trinity College Dublin. He has previously worked for AT&T Inc., Soliloquy Inc., the University of Washington, Rutgers University, and SkillSoft, Inc. He has over 100 publications in such areas as Bayesian statistics, text mining, Monte Carlo methods, pharmacovigilance and probabilistic graphical models. He is an elected Fellow of the American Statistical Association and of the Institute of Mathematical Statistics. He recently completed a term as Editor-in-Chief of Statistical Science.

Lauren Hannah

Lauren Hannah is an Assistant Professor in the Department of Statistics at Columbia University. Dr. Hannah received a Ph.D. in Operations Research and Financial Engineering from Princeton University, and an A.B. in Classics, again from Princeton University. After completing her Ph.D., Dr. Hannah completed a postdoc at Duke in the Statistical Science Department. Her interests include machine learning, Bayesian statistics, and energy applications.

Eva Ascarza

Eva Ascarza is an Assistant Professor of Marketing at Columbia Business School. She is a marketing modeler who uses tools from statistics and economics to answer marketing questions. Her main research areas are customer analytics and pricing in the context of subscription businesses. She specializes in understanding and predicting changes in customer behavior, such as customer retention and usage. Another stream of her research focuses on developing statistical methodologies to be used by marketing practitioners. She received her PhD from London Business School (UK) and a MS in Economics and Finance from Universidad de Navarra (Spain).

What is a bootcamp?

Our facilitated bootcamps focus on rapid skill acquisition by progressing you through a standard course on an accelerated schedule with peers who are committed to progressing on pace. Our bootcamps include:

  • Live kick-off event
  • Instructor facilitated Q&A for expert feedback and coaching
  • Learner Success Support: welcome call, advising sessions, personalized pace reminders
  • 24/7 help desk

About This Course:

This bootcamp has a focus on learning the most commonly used project management methodologies in the IT field, and why they are effective. This bootcamp introduces you to project management standards and frameworks that increase efficiency and deliver tangible business benefits to IT projects.

Topics include:

  • Relationships among projects, programs and portfolios
  • Organizational culture and project management roles
  • Project management methods and lifecycles and their applications

This course can be used towards completion of a Professional Certificate in IT Project Management.

What You Will Learn:

By the end of this course, you will be able to:

  • Explain why organizations use project management to deliver business value
  • Describe the relationships among projects, programs and portfolios
  • Define the differences between predictive, iterative and Agile-based lifecycles
  • Analyze different project management roles – project manager, sponsor, stakeholder, ScrumMaster, product owner and developer
  • Understand how organizational culture can influence the role of the project manager

Meet Your Instructor:

Debra Hildebrand

Instructor of Project Management at University of Washington
Debra Hildebrand has over 15 years’ experience in project management consulting and training for private firms and public agencies (federal, state and local level). Her specialties include strategic planning and implementation, organizational redesign and restructure, project management, and quality assurance oversight. Certified as a Project Management Professional (PMP) by the Project Management Institute, Debra is an instructor at the University of Washington, Stanford University, and City University of Seattle. She has an MBA from the Columbia Business School and is the principal of Hildebrand Solutions, a project management training and consultation firm.

What is a Bootcamp?

Our facilitated bootcamps focus on rapid skill acquisition by progressing you through a standard course on an accelerated schedule with peers who are committed to progressing on pace. Our bootcamps include:

  • Live kick-off event
  • Instructor facilitated Q&A for expert feedback and coaching
  • Learner Success Support: welcome call, advising sessions, personalized pace reminders
  • 24/7 help desk

About This Course:

In this bootcamp we start by learning the key project management processes, roles, mechanics, and philosophies behind Scrum. This will provide the basis for all understanding Agile in its purest form, exploring Why, Who, How, and finally What Scrum looks like applied in the real world. From understanding the agile team members, like scrum master and product owner, to the important differences in lean and agile processes

Scrum and Agile are often considered synonymous, and there is a good reason. Scrum embodies the simplest and most pure approach to managing project work at the team level. Scrum is employed by over half of all Agile practitioners across all industries. While agile may have started in software development, many industries now use an agile methodology to deliver their work. Development teams around the world are now using Kanban boards and assigning strong product owners to direct self-organizing teams to deliver on prioritized product backlogs. And nearly every new product has some sort of IT component and goes through an agile development lifecycle.

  • Today nearly 100% of IT organizations use Agile and many other industries are quickly following
  • The likelihood of being on a Scrum or Scrum-like project is quickly approaching 50/50 or better over time.

While this course will not make you an agile certified practitioner (PMI-ACP), or certified scrum master (CSM), it offers a more fundamental agile certification based on agile principles and how scaled agile is applied in industry today.

Upon successful completion of this course, learners can earn 10 Professional Development Unit (PDU) credits, which are recognized by the Project Management Institute (PMI). PDU credits are essential to those looking to maintain certification as a Project Management Professional (PMP).

This course can be used towards completion of a Professional Certificate in Agile Project Management.

What You Will Learn:

  • Why Agile is taking over: history, case studies, and proof Agile works better
  • Who uses Agile based on industry scale, stakeholders, and engineering
  • How to run a successful Scrum team for speed, innovation, leadership, and control
  • Scrum team makeup, user story writing, sprint planning, execution, and retro tools
  • What Scrum looks like at scale, its alternatives, and how to avoid pitfalls over time

Meet Your Instructor:

John Johnson

Adjunct Professor, Clarke School of Engineering, College Park; Chief Technology Officer, Softek Enterprises LLC at
Mr. Johnson, serves as the Chief Technology Officer for Softek Enterprises LLC, a minority-owned small business providing technology solutions to government clients since 2007. Softek specializes in evolving business systems using Agile, DevOps, and Cloud technologies to deliver working solutions faster for the government’s most critical IT challenges. He has 10 years of project management, systems engineering, and advanced analytics experience. Prior to joining Softek, Mr. Johnson co-founded Second Nature Software LLC, a data science products company focused on Life Science Research. He helped design and promote their product “Rocketfish,” a data management tool that simplifies preparing data for analysis while automating data tracking and organization. Rocketfish is currently in an organization-wide trial at NCI and NIAID, as well as major universities in the DC Metro Area. Previously, Mr. Johnson was a Senior Agile Project Manager with IBM, where he led multiple development teams building applications for the National Archives Records Administration (NARA). These applications were built on Amazon’s Gov Cloud (AWS) with cutting-edge cloud technologies to process, store, and search the hundreds of petabytes of government records expected at NARA by 2020. This project won “Project of the Year” across all of IBM globally for its success in project management innovation. He also worked as a Management Consultant with Booz Allen Hamilton, where he led projects for the Marine Corps, Air Force, and Navy from optimizing site investments and posture for Reserve forces, to developing award-winning project analysis and portfolio management software to optimize billions in shore energy investments. Mr. Johnson holds a Masters in Systems Engineering and a B.S. in Civil Engineering from the University of Maryland.

What is a Bootcamp?

Our facilitated bootcamps focus on rapid skill acquisition by progressing you through a standard course on an accelerated schedule with peers who are committed to progressing on pace. Our bootcamps include:

  • Live kick-off event
  • Instructor facilitated Q&A for expert feedback and coaching
  • Learner Success Support: welcome call, advising sessions, personalized pace reminders
  • 24/7 help desk

About This Course:

This bootcamp covers the basics of data visualization and exploratory data analysis. We will use three motivating examples and ggplot2; a data visualization package for the statistical programming language R. We will start with simple datasets and then graduate to case studies about world health, economics, and infectious disease trends in the United States.

We’ll also be looking at how mistakes, biases, systematic errors, and other unexpected problems often lead to data that should be handled with care.

The fact that it can be difficult or impossible to notice a mistake within a dataset makes data visualization particularly important. The growing availability of informative datasets and software tools has led to increased reliance on data visualizations across many areas. Data visualization provides a powerful way to communicate data-driven findings, motivate analyses, and detect flaws.

This course can be used towards completion of a Professional Certificate in Data Science.

What You Will Learn:

  • Data visualization principles
  • How to communicate data-driven findings
  • How to use ggplot2 to create custom plots
  • The weaknesses of several widely used plots and why you should avoid them

Meet Your Instructors:

Rafael Irizarry

Professor of Biostatistics at Harvard University
Rafael Irizarry is a Professor of Biostatistics at the Harvard T.H. Chan School of Public Health and a Professor of Biostatistics and Computational Biology at the Dana Farber Cancer Institute. For the past 15 years, Dr. Irizarry’s research has focused on the analysis of genomics data. During this time, he has also has taught several classes, all related to applied statistics. Dr. Irizarry is one of the founders of the Bioconductor Project, an open source and open development software project for the analysis of genomic data. His publications related to these topics have been highly cited and his software implementations widely downloaded.

Leonardo Palomera

Leonardo Palomera, through his professional and academic experiences, has become a specialist in data analytics for a variety of subjects including education, statistics, economics, and finance. Currently Leonardo works as a Data Scientist with Pearson Advance to support our goal of personalizing support that leads to high course completion rates. Additionally, Leonardo has held teaching positions at University of Southern California (USC), University of California, Los Angeles (UCLA), University of Colorado, Boulder, University of Denver, and California State University, Long Beach (CSULB). His goal is to empower students to gain the knowledge and skills they need to conduct robust analytics on a host of real-world problems.

Frequently Asked Questions:

Honor code statement
HarvardX requires individuals who enroll in its courses on edX to abide by the terms of the edX honor code. HarvardX will take appropriate corrective action in response to violations of the edX honor code, which may include dismissal from the HarvardX course; revocation of any certificates received for the HarvardX course; or other remedies as circumstances warrant. No refunds will be issued in the case of corrective action for such violations. Enrollees who are taking HarvardX courses as part of another program will also be governed by the academic policies of those programs.

Research statement
By registering as an online learner in our open online courses, you are also participating in research intended to enhance HarvardX’s instructional offerings as well as the quality of learning and related sciences worldwide. In the interest of research, you may be exposed to some variations in the course materials. HarvardX does not use learner data for any purpose beyond the University’s stated missions of education and research. For purposes of research, we may share information we collect from online learning activities, including Personally Identifiable Information, with researchers beyond Harvard. However, your Personally Identifiable Information will only be shared as permitted by applicable law, will be limited to what is necessary to perform the research, and will be subject to an agreement to protect the data. We may also share with the public or third parties aggregated information that does not personally identify you. Similarly, any research findings will be reported at the aggregate level and will not expose your personal identity.

Please read the edX Privacy Policy for more information regarding the processing, transmission, and use of data collected through the edX platform.

Nondiscrimination/anti-harassment statement
Harvard University and HarvardX are committed to maintaining a safe and healthy educational and work environment in which no member of the community is excluded from participation in, denied the benefits of, or subjected to discrimination or harassment in our program. All members of the HarvardX community are expected to abide by Harvard policies on nondiscrimination, including sexual harassment, and the edX Terms of Service. If you have any questions or concerns, please contact harvardx@harvard.edu and/or report your experience through the edX contact form.

What is a Bootcamp?

Our facilitated bootcamps focus on rapid skill acquisition by progressing you through a standard course on an accelerated schedule with peers who are committed to progressing on pace. Our bootcamps include:

  • Live kick-off event
  • Instructor facilitated Q&A for expert feedback and coaching
  • Learner Success Support: welcome call, advising sessions, personalized pace reminders
  • 24/7 help desk

About This Course:

Kickstart your learning of Python for data science, as well as programming in general with this introduction to Python course. This beginner-friendly Python bootcamp will quickly take you from zero to programming in Python in a matter of hours and give you a taste of how to start working with data in Python.

Upon its completion, you’ll be able to write your own Python scripts and perform basic hands-on data analysis using our Jupyter-based lab environment.

You can start creating your own data science projects and collaborating with other data scientists using IBM Watson Studio. When you sign up, you will receive free access to Watson Studio. Start now and take advantage of this platform and learn the basics of programming, machine learning, and data visualization with this introductory course.

What You Will Learn:

  • What Python is and why it is useful
  • The application of Python to Data Science
  • How to define variables in Python
  • Sets and conditional statements in Python
  • The purpose of having functions in Python
  • How to operate on files to read and write data in Python
  • How to use pandas, a must have package for anyone attempting data analysis in Python.

Meet Your Instructor:

Joseph Santarcangelo

PhD., Data Scientist at IBM

Joseph Santarcangelo is currently working as a Data Scientist at IBM. Joseph has a Ph.D. in Electrical Engineering. His research focused on using machine learning, signal processing, and computer vision to determine how videos impact human cognition.

What is a Bootcamp?

Our facilitated bootcamps focus on rapid skill acquisition by progressing you through a standard course on an accelerated schedule with peers who are committed to progressing on pace. Our bootcamps include:

  • Live kick-off event
  • Instructor facilitated Q&A for expert feedback and coaching
  • Learner Success Support: welcome call, advising sessions, personalized pace reminders
  • 24/7 help desk

About This Course:

This bootcamp will introduce you to the basics of R programming. You can better retain R when you learn it to solve a specific problem, so you’ll use a real-world dataset about crime in the United States. You will learn the R skills needed to answer essential questions about differences in crime across the different states.

We’ll cover R’s functions and data types, then tackle how to operate on vectors and when to use advanced functions like sorting. You’ll learn how to apply general programming features like “if-else,” and “for loop” commands, and how to wrangle, analyze and visualize data.

We help you develop a skill set that includes R programming, data wrangling with dplyr, data visualization with ggplot2, file organization with UNIX/Linux, version control with git and GitHub, and reproducible document preparation with RStudio.

This course can be used towards completion of a Professional Certificate in Data Science .

What You Will Learn:

  • Basic R syntax
  • Foundational R programming concepts such as data types, vectors arithmetic, and indexing
  • How to perform operations in R including sorting, data wrangling using dplyr, and making plots

Meet Your Instructors:

Rafael Irizarry

Professor of Biostatistics at Harvard University
Rafael Irizarry is a Professor of Biostatistics at the Harvard T.H. Chan School of Public Health and a Professor of Biostatistics and Computational Biology at the Dana Farber Cancer Institute. For the past 15 years, Dr. Irizarry’s research has focused on the analysis of genomics data. During this time, he has also has taught several classes, all related to applied statistics. Dr. Irizarry is one of the founders of the Bioconductor Project, an open source and open development software project for the analysis of genomic data. His publications related to these topics have been highly cited and his software implementations widely downloaded.

Leonardo Palomera

Leonardo Palomera, through his professional and academic experiences, has become a specialist in data analytics for a variety of subjects including education, statistics, economics, and finance. Currently Leonardo works as a Data Scientist with Pearson Advance to support our goal of personalizing support that leads to high course completion rates. Additionally, Leonardo has held teaching positions at University of Southern California (USC), University of California, Los Angeles (UCLA), University of Colorado, Boulder, University of Denver, and California State University, Long Beach (CSULB). His goal is to empower students to gain the knowledge and skills they need to conduct robust analytics on a host of real-world problems.

Frequently Asked Questions:

Honor code statement
HarvardX requires individuals who enroll in its courses on edX to abide by the terms of the edX honor code. HarvardX will take appropriate corrective action in response to violations of the edX honor code, which may include dismissal from the HarvardX course; revocation of any certificates received for the HarvardX course; or other remedies as circumstances warrant. No refunds will be issued in the case of corrective action for such violations. Enrollees who are taking HarvardX courses as part of another program will also be governed by the academic policies of those programs.

Research statement
By registering as an online learner in our open online courses, you are also participating in research intended to enhance HarvardX’s instructional offerings as well as the quality of learning and related sciences worldwide. In the interest of research, you may be exposed to some variations in the course materials. HarvardX does not use learner data for any purpose beyond the University’s stated missions of education and research. For purposes of research, we may share information we collect from online learning activities, including Personally Identifiable Information, with researchers beyond Harvard. However, your Personally Identifiable Information will only be shared as permitted by applicable law, will be limited to what is necessary to perform the research, and will be subject to an agreement to protect the data. We may also share with the public or third parties aggregated information that does not personally identify you. Similarly, any research findings will be reported at the aggregate level and will not expose your personal identity.

Please read the edX Privacy Policy for more information regarding the processing, transmission, and use of data collected through the edX platform.

Nondiscrimination/anti-harassment statement
Harvard University and HarvardX are committed to maintaining a safe and healthy educational and work environment in which no member of the community is excluded from participation in, denied the benefits of, or subjected to discrimination or harassment in our program. All members of the HarvardX community are expected to abide by Harvard policies on nondiscrimination, including sexual harassment, and the edX Terms of Service. If you have any questions or concerns, please contact harvardx@harvard.edu and/or report your experience through the edX contact form.