Winter 2024 (2 units)
Survey of Data Science methods in Python, starting with common data science tools and processes and spending one week per topics learning to build common ML/AI solutions.
Objectives
At the conclusion of this course, students will be able to:
- Develop Proficiency in Python Programming and Data Science Tools: Equip students with the skills to proficiently use essential data science technologies and processes common in industry; including Python, Git, SQL, Pandas, and data visualization libraries, providing a strong foundation for conducting data analysis in real-world scenarios.
- Apply Machine Learning to Solve Real-World Problems: Enable students to apply machine learning techniques including data cleaning, classification and time series analysis, to real-world data science challenges, emphasizing their practical application in fields like health data science.
- Implement Advanced Data Science Concepts: Empower students to implement advanced data science solutions, including generative AI and Large Language Model (LLM) development, with a focus on their relevance and applications in healthcare and health data analysis.
- Develop Effective Data Communication Skills: Enhance students' ability to communicate data insights effectively through data visualization and storytelling, a crucial skill for conveying findings in health data science contexts.
Prerequisites
Familiarity with programming concepts, including loops, variables, and functions. Ideally, hands-on experience writing and running scripts such as in: Python, R, Bash, or other programming languages. Exceptions to this prerequisite may be made with the consent of the Course Director, space permitting. This course is part of the Health Data Science Masters and Certificate Program and may have space limitations. Auditing is not permitted.
Exceptions to this prerequisite may be made with the consent of the Course Director, space permitting.
Faculty
Course Director: | Christopher Seaman, MA Title: email: [email protected] |
Format
Lectures will be held on Mondays, 3:00 - 5:00 PM, January 8 through March 18. Labs will be held on Fridays, 9:00 - 10:30 PM.
Each week will dive into a different focus area. The lecture will provide an overview of new concepts and tools introduced that week, closing with a hands-on exercise for students to apply those concepts while writing their own code. Labs will not introduce new material; instead, they provide a forum for collaboration between students and staff to help each other with the current material.
Students are encouraged to collaborate in small groups but may also work independently. The class culminates with a larger, multi-week project applying advanced techniques or combining applications from multiple focus areas.
All course materials and handouts will be posted on the course's online syllabus.
Materials
- Syllabus & lecture notes/slides
- Reference books (including freely available options)
- Tools:
- Markdown
- Python
- Git + GitHub
- Jupyter notebooks or Google Colab
- Visual Studio Code
Grading
Final grades will be based on the class participation (40%) and submitted exercises (60%).
Only UCSF students (defined as individuals enrolled in UCSF degree or certificate programs) will receive academic credit for courses. Official transcripts are available to UCSF students only. A Certificate of Course Completion will be available upon request to individuals who are not UCSF students and satisfactorily pass all course requirements.
To Enroll
Matriculated students (ATCR/MAS/CiHDaS/MiHDaS/PhD) use the Student Portal.
Course Fees
How to pay (please read before applying)
Only one application needs to be completed for all courses desired during the quarter.
Apply by January 15, 2024 (deadline extended)