Skip to content

Practical exercises and resources from the 12-course IBM Data Science Professional Certificate program, tackling DS fundamentals, machine learning, data analysis, and visualization using Python

Notifications You must be signed in to change notification settings

jhermienpaul/ibm-data-science-program

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

IBM Data Science Professional Certificate

Prepare for a career as a data scientist. Build job-ready skills – and must-have AI skills – for an in-demand career.

Coursera: IBM Data Science Professional Certificate

Certificate

Verify this certificate on Credly


📖 What you'll learn

  • Master the most up-to-date practical skills and knowledge that data scientists use in their daily roles
  • Learn the tools, languages, and libraries used by professional data scientists, including Python and SQL
  • Import and clean data sets, analyze and visualize data, and build machine learning models and pipelines
  • Apply your new skills to real-world projects and build a portfolio of data projects that showcase your proficiency to employers

📈 Skills you'll gain

Data Science Data Analytics ETL Data Wrangling Data Modeling Data Analysis Statistics Correlation A/B Testing Machine Learning Feature Engineering Dimensionality Reduction PCA tSNE and UMAP Linear Regression Logistic Regression Ridge Regression Lasso Regression Support Vector Machine KNN Naive Bayes Model K-Means Hierarchical Clustering DBSCAN Decision Trees Random Forest XGBoost Recommender Systems Hyperparameter Tuning Grid Search Random Search Bayesian Optimization Data Visualization Data Storytelling Report Automation Dashboard Development SQL IBM Cognos Analytics Python IBM Db2 JupyterLab

🏆 Endorsements and recognition

  • ACE® College Credit Recommendation: Earn up to 12 US semester credits upon completion (via ACE® and Coursera)
  • FIBAA Certified: Recognized for 6 ECTS credits by European institutions
  • IBM Digital Badge: Receive a verified IBM Data Science Professional Certificate badge via Credly
  • Career Support: Access IBM’s Talent Network with personalized job opportunities, skill-based recommendations, and interview tips

📚 Courses and lessons

  1. What is Data Science?

    • Define data science and its importance in today’s data-driven world.
    • Describe the various paths that can lead to a career in data science.
    • Summarize advice given by seasoned data science professionals to data scientists who are just starting out.
    • Explain why data science is considered the most in-demand job in the 21st century.
  2. Tools for Data Science

    • Describe the Data Scientist’s tool kit which includes: Libraries & Packages, Data sets, Machine learning models, and Big Data tools
    • Utilize languages commonly used by data scientists like Python, R, and SQL
    • Demonstrate working knowledge of tools such as Jupyter notebooks and RStudio and utilize their various features
    • Create and manage source code for data science using Git repositories and GitHub.
  3. Data Science Methodology

    • Describe what a data science methodology is and why data scientists need a methodology.
    • Apply the six stages in the Cross-Industry Process for Data Mining (CRISP-DM) methodology to analyze a case study.
    • Evaluate which analytic model is appropriate among predictive, descriptive, and classification models used to analyze a case study.
    • Determine appropriate data sources for your data science analysis methodology.
  4. Python for Data Science, AI & Development

    • Develop a foundational understanding of Python programming by learning basic syntax, data types, expressions, variables, and string operations.
    • Apply Python programming logic using data structures, conditions and branching, loops, functions, exception handling, objects, and classes.
    • Demonstrate proficiency in using Python libraries such as Pandas and Numpy and developing code using Jupyter Notebooks.
    • Access and extract web-based data by working with REST APIs using requests and performing web scraping with BeautifulSoup.
  5. Python Project for Data Science

    • Play the role of a Data Scientist / Data Analyst working on a real project.
    • Demonstrate your Skills in Python - the language of choice for Data Science and Data Analysis.
    • Apply Python fundamentals, Python data structures, and working with data in Python.
    • Build a dashboard using Python and libraries like Pandas, Beautiful Soup and Plotly using Jupyter notebook.
  6. Databases and SQL for Data Science (with Python)

    • Analyze data within a database using SQL and Python.
    • Create a relational database and work with multiple tables using DDL commands.
    • Construct basic to intermediate level SQL queries using DML commands.
    • Compose more powerful queries with advanced SQL techniques like views, transactions, stored procedures, and joins.
  7. Data Analysis with Python

    • Construct Python programs to clean and prepare data for analysis by addressing missing values, formatting inconsistencies, normalization, and binning
    • Analyze real-world datasets through exploratory data analysis (EDA) using libraries such as Pandas, NumPy, and SciPy to uncover patterns and insights
    • Apply data operation techniques using dataframes to organize, summarize, and interpret data distributions, correlation analysis, and data pipelines
    • Develop and evaluate regression models using Scikit-learn, and use these models to generate predictions and support data-driven decision-making
  8. Data Visualization with Python

    • Implement data visualization techniques and plots using Python libraries, such as Matplotlib, Seaborn, and Folium to tell a stimulating story
    • Create different types of charts and plots such as line, area, histograms, bar, pie, box, scatter, and bubble
    • Create advanced visualizations such as waffle charts, word clouds, regression plots, maps with markers, & choropleth maps
    • Generate interactive dashboards containing scatter, line, bar, bubble, pie, and sunburst charts using the Dash framework and Plotly library
  9. Machine Learning with Python

    • Explain key concepts, tools, and roles involved in machine learning, including supervised and unsupervised learning techniques.
    • Apply core machine learning algorithms such as regression, classification, clustering, and dimensionality reduction using Python and scikit-learn.
    • Evaluate model performance using appropriate metrics, validation strategies, and optimization techniques.
    • Build and assess end-to-end machine learning solutions on real-world datasets through hands-on labs, projects, and practical evaluations.
  10. Applied Data Science Capstone

    • Demonstrate proficiency in data science and machine learning techniques using a real-world data set and prepare a report for stakeholders
    • Apply your skills to perform data collection, data wrangling, exploratory data analysis, data visualization model development, and model evaluation
    • Write Python code to create machine learning models including support vector machines, decision tree classifiers, and k-nearest neighbors
    • Evaluate the results of machine learning models for predictive analysis, compare their strengths and weaknesses and identify the optimal model
  11. Generative AI: Elevate Your Data Science Career

    • Leverage generative AI tools, like GPT 3.5, ChatCSV, and tomat.ai, available to Data Scientists for querying and preparing data
    • Examine real-world scenarios where generative AI can enhance data science workflows
    • Practice generative AI skills in hand-on labs and projects by generating and augmenting datasets for specific use cases
    • Apply generative AI techniques in the development and refinement of machine learning models
  12. Data Scientist Career Guide and Interview Preparation

    • Describe the role of a data scientist and some career path options as well as the prospective opportunities in the field.
    • Explain how to build a foundation for a job search, including researching job listings, writing a resume, and making a portfolio of work.
    • Summarize what a candidate can expect during a typical job interview cycle, different types of interviews, and how to prepare for interviews.
    • Explain how to give an effective interview, including techniques for answering questions and how to make a professional personal presentation.

🚀 How to use this repo

This repo is open source! Feel free to:

  • 👀 Browse the course readings, exercises, and case studies
  • 💻 Fork/clone for your own self-study or review
  • 🤝 Collaborate by submitting issues or improvements via pull requests
  • 🌟 Get inspired if you’re preparing to be a data professional or want to level up your data skills

Disclaimer: All content is for educational purposes only and is shared to help aspiring data professionals. Please don’t submit this work as your own in graded assessments—let’s keep it ethical!


✨ I’m always open to networking, collaboration, or sharing insights ✨
Don’t be shy — connect with me on LinkedIn! 👋

LinkedIn Badge