đ Syllabus
Table of contents
- About đ§
- Getting Started đ»
- Communication đŹ
- Course Components đ
- Exams đ
- Policies đŻ
- Collaboration Policy and Academic Integrity đ€
- Support đ«
- Acknowledgements đ
About đ§
DSC 80 serves as a bridge between lower-division and upper-division data science courses. In DSC 80, students will gain proficiency with the data science life cycle and learn many of the fundamental principles and techniques of data science spanning algorithms, statistics, machine learning, visualization, and data systems.
After DSC 80, students will be prepared for data science internships and interviews, will have the tools to create their own data science portfolios, and will have the maturity necessary to succeed in upper-division machine learning and statistics courses.
Prerequisites: DSC 30 and DSC 40A.
Getting Started đ»
The course website, dsc80.com, will contain links to all course content. There are also a few things youâll need to do to get set up.
Websites
Youâll need to make accounts on the following sites:
Campuswire: Weâll be using Campuswire as our course message and discussion board. More details are in the Communication section below. If you didnât already get an invitation to our Campuswire course, sign up here with join code 4659.
Gradescope: Youâll submit all assignments and exams to Gradescope. This is where all of your grades will live as well. Most of the assignments will be coding assignments. Parts of these assignments will be manually graded, but most of them will be autograded. You should have received an email invitation for Gradescope, but if not please let us know as soon as possible via Campuswire.
GitHub: Like in DSC 30, youâll access all course content (lecture slides and assignments) by pulling our course GitHub repository. The link to the repo is here. In most assignments, you wonât need to push anything to GitHub. However, you will need to push to GitHub as part of your Final Project, so youâll need to have an account by then.
Note that we will not be using Canvas for anything this quarter.
Development Environment
As soon as you are able to, go follow the steps in the Tech Support page of the course website to set up your development environment for the course. The first discussion section on Wednesday, January 7th is designed to help you get your environment set up so you can complete the assignments.
Forms
Please fill out the Welcome Survey to tell us a bit more about yourself and notify us about any exam conflicts.
Communication đŹ
This quarter, weâll be using Campuswire as our course message board. If you didnât already get an invitation to our Campuswire course, sign up here with join code 4659.
If you have a question about anything to do with the course â if youâre stuck on a problem, didnât understand something from lecture, want clarification on course logistics, or just have a general question about data science â you can make a post on Campuswire. We only ask that if your question includes some or all of an answer (even if youâre not sure itâs right), please make your post private so that others cannot see it. You can also post anonymously to other students if you prefer.
Course staff will regularly check Campuswire and try to answer any questions that you have. Youâre also encouraged to answer questions asked by other students. Explaining something is a great way to solidify your understanding of it!
Please donât email or DM individual staff members. Make a private or public Campuswire post instead.
Course Components đ
Lectures
Lectures will be held in-person on Tuesdays and Thursdays from 9:30-10:50AM and 11AM-12:20PM in PODEM 1A18 Attendance is not required, though you are encouraged to attend in-person if you are able to. Lectures will be podcasted.
Lecture notebooks will be your main resource in this class. You can access them, along with all course materials, by pulling from the course GitHub repository. We will also link HTML previews of each lecture notebook from the course homepage; you can use these to annotate the lecture notebooks with a tablet, if youâd like.
Supplementary readings will primarily come from Learning Data Science, a textbook written by Sam Lau, Joey Gonzalez, and Deb Nolan. It is available for free online at learningds.org. Some readings will come from other free online sources, including notes.dsc80.com, a set of course notes that were written to supplement DSC 80 when the course was first created.
You are responsible for all material included in the posted lecture notebooks, whether or not we have time to discuss all of that material in lecture. You may also be tested on material covered only in assignments. You will not be tested on anything that appears only in the readings but not in lectures or assignments, but you should still complete the readings to boost your understanding!
Discussions and Lab Reflections
Discussions will be held in-person on Wednesdays from 1-1:50PM and 2-2:50PM in CSB 002. Discussion sections will be podcasted.
Youâll spend the vast majority of your time in this course on labs and projects, which youâll read more about in the sections below. The labs you complete each week will give you hands-on practice with the tools and techniques introduced in lectures. While completing the labs is important, itâs also important to reflect on your lab work once grades are released, and think about how you could have approached problems differently (e.g. more efficiently).
Therefore, in discussion sections on Wednesdays, the TA will discuss solutions to a subset of the lab that was due that Monday (2 days prior). When you attend, youâll have a chance to discuss your implementation with course staff and hear how others attempted the problems. Hopefully, youâll leave with a stronger understanding of the learning objectives of the lab. Attending discussion and reflecting on your labs will also benefit you since youâll be allowed to resubmit labs for some additional credit via a redemption policy, described in the next section.
Labs and Lab Redemption
There will be 9 lab assignments due weekly throughout the quarter. Each lab assignment will consist of a mixture of coding and free response questions. Coding questions will ask you to fill in the body of a function. Public tests are usually provided so that you can make sure that you're on the right track (similar to DSC 20), however, your submission will be graded using a private autograder with hidden tests. You will access labs (and projects) by pulling the course GitHub repository.
Each lab is worth the same amount, but the lowest lab will be dropped when calculating your final score. Labs will usually be due Mondays at 11:59PM, with the exception of Lab 5 which is due on a Sunday, because of its proximity to the Midterm Exam.
We will also have a lab redemption policy to encourage you to reflect on your solutions to the lab assignments and attend the discussion sections. On Wednesday morning, before the discussion section, we will publish the autograder results for the lab that was just due Monday. You may then review how you did and resubmit the assignment by the next day, Thursday at 11:59PM. Your score for the lab will be the average of your original submissionâs score and your resubmissionâs score. If you forget to turn in a lab or otherwise miss the original deadline, you can also turn in your work for the resubmission only, which will be averaged with your original score (0), essentially giving you half credit as a late penalty.
Example: Sam submits Lab 1 on the due date, Monday. On Wednesday, he sees his score, which is a 90% since he lost points on a few hidden tests. He fixes the errors and resubmits on Thursday, and his new submission gets 100%. His final recorded grade for Lab 1 is 95% (since (90% + 100%) / 2 = 95%).
Note that this redemption policy applies to labs only, not projects or project checkpoints.
Projects
There will be 4 projects due throughout the quarter. Like labs, projects consist of coding and free response questions. As their name implies, however, projects are more open-ended and allow you to simulate applying your data science skills in practical situations. You can think of the projects as being mini-take-home-exams that track your practical skills throughout the quarter (whereas the exams themselves test for conceptual understanding).
Projects are due bi-weekly. However, the week before a project is due, there will often be a project checkpoint. This checkpoint will ensure that you're on-track to complete the project on time, and should (hopefully) be a source of easy points.
The Final Project (Project 4) will be due during finals week and can be thought of as a practical component of the Final Exam.
Note that, unlike labs, the lowest project score is not dropped. Projects and project checkpoints will usually be due on Fridays at 11:59PM.
Working in Pairs
You may work together on projects (and projects only!) with a partner. If you work with a partner, you are both required to actively contribute to all parts of the project. You must both be working on the assignment at the same time together, either physically or virtually on a Zoom call. You are encouraged to follow the pair programming model, in which you work on just a single computer and alternate who writes the code and who thinks about the problems at a high level.
In particular, you cannot split up the project and each work on separate parts independently.
If you work with a partner:
- Only one partner needs to submit the project on Gradescope; this partner should add the other partner to their submission.
- You must also submit the checkpoint together.
- You and your partner will receive the same score on any submissions you make together.
If you are unhappy with your partnership (e.g., if your partner does not keep in touch, does not come prepared to work on the assignment, or does not seem to be engaged in the process), please first address your concerns to your partner, and try to agree on what should be done to make the partnership work well for both of you. If that approach is not successful, explain the issues to the instructors, who will work with you and your partner to improve the situation.
You may use different partners on different projects.
Note that you may not work with partners on lab assignments, however youâre encouraged to discuss all assignments with others at a conceptual level in office hours and study groups.
Office Hours
To get help on assignments and concepts, course staff will be hosting several office hours per week. All office hours will be held in person in HDSI 155. See the Calendar tab of the course website for the most up-to-date schedule and instructions.
Weekly Schedule
To summarize all of the events and deadlines, refer to this general weekly schedule (which is subject to change in any given week):
| Sunday | Monday | Tuesday | Wednesday | Thursday | Friday | Saturday |
|---|---|---|---|---|---|---|
| Â | Â | Lecture | Â | Lecture | Â | Â |
| Â | Â | Â | Discussion | Â | Â | Â |
| Â | Lab due | Â | Â | Lab resubmission | Project/checkpoint due | Â |
Exams đ
This class has one Midterm Exam and one Final Exam. Exams are cumulative, though the Final Exam will emphasize material after the Midterm Exam.
Midterm Exam: Tuesday, February 10th (Week 6) during lecture
Final Exam: Saturday, March 14th, 8-11AM
Both exams will be administered in-person. If you have conflicts with either of the exams, please let us know on the Welcome Survey.
Exam Redemption
The Final Exam will consist of two parts: a âMidtermâ section and a âpost-Midtermâ section. If you do better on the âMidtermâ section of the Final Exam than you did on the original Midterm Exam, your score on the âMidtermâ section will replace your original Midterm Exam score. This lowers the stakes of the Midterm Exam and gives you two opportunities to demonstrate your understanding of the content from the first half of the course. This also means that you can miss the Midterm Exam for any reason and have the score be replaced by your score on the âMidtermâ section of the Final Exam (though we do not recommend this).
You must take the Final Exam to pass the course.
Policies đŻ
Grading
Here is how weâll compute your grade:
| Component | Weight | Notes |
|---|---|---|
| Labs | 20% | 2.5% per lab, lowest dropped |
| Projects | 25% | 5% each for Projects 1-3, 10% for Project 4 |
| Project Checkpoints | 5% | 1% each |
| Midterm Exam | 20% | see the Redemption Policy above |
| Final Exam | 30% | Â |
Late Policy, Slip Days, and Drops
All assignments must be submitted by 11:59PM San Diego time on the due date to be considered on time. You may turn them in as many times as you like before the deadline and only the most recent submission will count, so itâs a good habit to submit early and often. If you make a submission after the deadline, your assignment will be counted as late.
You have 5 âslip daysâ to use throughout the quarter on projects or project checkpoints. A slip day extends the deadline of an assignment by 24 hours. You may use up to 2 slip days for a single deadline, meaning no assignments will be accepted more than 48 hours after the deadline.
You may not use slip days on lab assignments. If you miss a deadline for a lab, you can use the resubmission policy to get 50% credit on the lab. We also drop your lowest lab score.
Slip days are designed to be a transparent and predictable source of leniency in deadlines. You can use a slip day if you are too busy to complete an assignment on its original due date (or if you forgot about it). But slip days are also meant for things like the internet going down at 11:58PM just as you go to submit your assignment. Slip days are meant to cover ânormalâ life circumstances, but if you have a serious illness, suffer a loss, or experience some other unfortunate event that severely disrupts your ability to participate in the class, please let us know ASAP!
Slip days are applied automatically at the end of the quarter, and you donât need to ask in order to use one. Itâs your responsibility to keep track of how many you have left. If youâve run out of slip days and submit an assignment late, that assignment may still be graded, but you will receive a 0 on it when we calculate grades at the end of the quarter. However, in the event that you use all 5 days and submit another assignment late, we will allocate your slip days to your projects in reverse-chronological order. This is done to prevent you from receiving a 0 on the Final Project if youâve run out of slip days and submit it late.
Regrade Requests
Most of the labs and projects are autograded, but some questions are manually graded. If you feel that there in an error in the autograder or that the manual grader has made a mistake, you may submit a regrade request within three days of the grades being released. If you do not submit a regrade request within this time frame, your original grade will be final.
Regrade Requests for Manually Graded Problems
To submit a regrade request for a manually graded problem, make the regrade request directly on Gradescope. Note that part of your grade is clarity, so if your answer was mostly right but unclear you may still not be eligible for full credit.
Regrade Requests for Autograded Problems
To submit an autograder regrade request, please make a private post to all instructors on Campuswire explaining the issue, including screenshots where appropriate.
The autograder is very picky: it expects your assignments to have exactly the correct file names, all functions must be named correctly, etc. If these are wrong, your code may not run and the autograder may assign zero points. This is a grading catastrophe đ§.
Grading catastrophes are preventable! After submitting your assignment, always wait around to see the output of the Gradescope grader and ensure that it runs properly. Also, be sure to submit your assignment (or at least part of it) to Gradescope with enough time before the deadline to get help if there is a strange autograder problem.
In the case that you submit code that doesnât run and discover this at a later date, you have some options:
- For a project or project checkpoint, if it is still before the slip day deadline, you may use slip days to fix your code and resubmit. Note that youâre free to do this even if your code runs â this is just making use of the normal slip day mechanism to submit an assignment late. Likewise, for a lab, if it is before the resubmission deadline, you can resubmit the lab as usual, and youâll get half credit because youâll have a 0 on the original submission.
- If it is past the slip day deadline and your code requires only minor fixes (e.g., the file name is wrong) we will fix your code at the cost of 2 slip days. Note that these slip days are in addition to any slip days you already used on the assignment. You can submit a catastrophe regrade request the same way you submit a regular autograder regrade request, via a private post on Campuswire.
Incompletes
In the unfortunate circumstance that you become sick, suffer a loss, or otherwise experience a significant setback that is outside of your control, you may be eligible for an Incomplete grade, which allows you to complete the rest of the work at a later time. If you are experiencing challenges due to circumstances outside your control, please contact me ASAP and we can discuss the best course of action. Note that an Incomplete does not allow you to re-do work that has already been completed, only to do work that hasnât been completed, so itâs best to reach out right away.
Letter Grades
We will use a standard scale for assigning letter grades:
| Letter Grade | A | A- | B+ | B | B- | C+ | C | C- | D | F |
| Percentage | 93+ | 90+ | 87+ | 83+ | 80+ | 77+ | 73+ | 70+ | 60+ | below 60 |
We may slightly lower these cutoffs, making it easier to earn a certain grade, but we will never raise the cutoffs. A+ grades are given at the instructorâs discretion. If you are taking the course P/NP, you will receive a grade of P if you meet the criteria for a C- grade, otherwise you will receive a grade of NP.
Collaboration Policy and Academic Integrity đ€
DSC 80 is known for being a rigorous but rewarding course. While you will be challenged this quarter, we will be offering you plenty of support through office hours and Campuswire. Make good use of these resources, and you will be able to succeed in this course.
There is no excuse for cheating in this course. If you do cheat, we will enforce the UCSD Policy on Integrity of Scholarship. This means you will likely fail the course and the Dean of your college will put you on probation or suspend or dismiss you from UCSD. Students agree that by taking this course, their assignments may be submitted to third-party software to help detect plagiarism.
Why is academic integrity important?
Academic integrity is an issue that is pertinent to all students on campus. When students act unethically by copying someoneâs work, taking an exam for someone else, plagiarizing, etc., these students are misrepresenting their academic abilities. This makes it impossible for instructors to give grades (and for the University to give degrees) that reflect student knowledge. This devalues the worth of a UCSD degree for all students, making it imperative for the the campus as a whole to enforce that all members of this community are honest and ethical. We want your degree to be meaningful and we want you to be proud to call yourself a graduate of UCSD!
The UCSD Policy on Integrity of Scholarship and this syllabus list some of the standards by which you are expected to complete your academic work, but your good ethical judgment (or asking us for advice) is also expected as we cannot list every behavior that is unethical or not in the spirit of academic integrity. Ignorance of the rules will not excuse you from any violations.
What counts as cheating?
In DSC 80, you can read books, surf the web, talk to your friends and the DSC 80 staff to get help understanding the concepts you need to know to complete your assignments. However, all code must be written by you (or, in the instance of projects, together with your partner).
The following activities are considered cheating and are not allowed in DSC 80 (not an exhaustive list):
- Using or submitting code acquired from other students (except from your partner during projects), the web, or any other resource not officially sanctioned by this course
- Posting your code online, including on Campuswire, unless privately to instructors only
- Having any other person complete any part of your assignment on your behalf
- Completing an assignment on behalf of someone else
- Providing code, exam questions, or solutions to any other student in the course
- Splitting up project questions with your partner and each working on different questions
- Collaborating with others on exams
The following activities are examples of appropriate collaboration and are allowed in DSC 80 (not an exhaustive list):
- Discussing the general approach to solving labs or projects
- Talking about problem-solving strategies or issues you ran into and how you solved them
- Discussing the answers to exams with other students who have already taken the exam after the exam is complete
- Using code provided in class, by the textbook or any other assigned reading or video, with attribution
- Google searching for documentation on Python or
pandas - Working together with other students on assignments without copying or sharing answers
- Posting a question about your approach to a problem on Campuswire, without sharing your code
The best way to avoid problems is by using your best judgement and remembering to act with Honesty, Trust, Fairness, Respect, Responsibility, and Courage. Here are some suggestions for completing your work:
- Donât look at or discuss the details of another studentâs code for an assignment you are working on, and donât let another student look at your code.
- Donât start with someone elseâs code and make changes to it, or in any way share code with other students.
- If you are talking to another student about an assignment, donât take notes, and wait an hour afterward before you write any code.
Use of Generative Artificial Intelligence
Generative Artificial Intelligence (GenAI) describes tools, such as ChatGPT and GitHub Copilot, that are trained to generate responses to user-defined prompts, or questions. The existence of such tools is a major milestone in machine learning, and an impressive application of data science in the real world.
Our course policy on the use of GenAI tools for coursework is simple: you may use these tools to build an understanding of course material and to assist you on assignments, keeping in mind that no tool is a substitute for a strong understanding of course concepts.
Be mindful of how you are using GenAI tools. These tools can be very useful to help you preview material before lecture, summarize material after lecture, explain concepts you didnât understand, and explore how different concepts are related. âExplain it like Iâm fiveâ can be a helpful prompt to give you a basic understanding of new concepts before being exposed to them in lecture. Consolidating your knowledge after learning something new and relating it to other things you know is important for learning and retention.
Unfortunately, GenAI tools are not a consistently reliable source of quality information. Because of how GenAI tools are trained, they often provide answers and write code that look correct, but arenât actually correct. A goal of your education is to develop an ability to identify and produce information that actually is correct and doesnât just sound correct. Human supervision of GenAI tools is always necessary.
Proceed with caution when using tools to assist you with your assignments. DSC 80 is a foundational class for your study of data science; you need to master the skills and concepts of this course if you want to use data science effectively. Through exams, you will be tested on your independent ability to apply course material to novel problems. Labs and projects are meant to prepare you for these assessments, so overreliance on GenAI for assignments will rob you of opportunities to learn and make it hard for you to perform well on exams.
If you do use GenAI to assist you on assignments, keep these guidelines in mind:
- Design your prompts carefully. Donât just ask one question; ask a follow-up question based on the output to the first. To use these tools effectively, you need to engineer your prompts carefully.
- Test the outputs. GenAI tools can and do make mistakes, and being able to verify the correctness of a proposed answer is an important skill for you to develop. Validate the output against course-provided references, or follow up with a search on Google or Stack Overflow. Remember that GenAI tools provide crowdsourced likely answers, not necessarily correct answers.
- Donât submit any code that you donât understand, or that uses content not taught in this class. In our experience last quarter, students who used ChatGPT to help with assignments ended up with code that was difficult for both them and the teaching staff to understand. If you answer questions with out-of-scope content, you are not practicing the foundational skills that the course is meant to teach you. Be careful!
If your assignment submission includes any content generated by an AI tool, it should be cited to acknowledge the source of the material. You can include citations as comments in your code.
Support đ«
Accommodations
From the Office for Students with Disabilities (OSD):
OSD works with students with documented disabilities to review documentation and determine reasonable accommodations. Disabilities can occur in these areas: psychological, psychiatric, learning, attention, chronic health, physical, vision, hearing, and acquired brain injuries, and may occur at any time during a studentâs college career. We encourage you to contact the OSD as soon as you become aware of a condition that is disabling so that we can work with you.
If you already have accommodations via OSD, please make sure that we receive your Authorization for Accommodation (AFA) letter by the end of Week 1 so that we can make arrangements for accommodations. Share your AFA letter with the instructor and the Data Science OSD Liaison, who can be reached at dscstudent@ucsd.edu.
Diversity and Inclusion
We are committed to an inclusive learning environment that respects our diversity of perspectives, experiences, and identities. Our goal is to create a diverse and inclusive learning environment where all students feel comfortable and can thrive. If you have any suggestions as to how we could create a more inclusive setting, please let us know. We also expect that you, as a student in this course, will honor and respect your classmates, abiding by the UCSD Principles of Community. Please understand that othersâ backgrounds, perspectives and experiences may be different than your own, and help us to build an environment where everyone is respected and feels comfortable.
Campus Resources
There are several campus resources available to you, including:
- UCSD Counseling and Psychological Services (CAPS)
- Hub Basic Needs Center
- Office for Students with Disability (OSD)
- Office for Prevention of Sexual Harassment and Discrimination
More generally, if you have any concerns about your ability to focus or succeed in this course, or just need someone to talk to, please contact us ASAP and weâll figure something out.
Waitlist
If youâre on the waitlist, make sure you participate in the class just as if you were enrolled so that if you do get in, youâre not behind. Submit all assignments on time, as you wonât be allowed to go back and do past-due assignments for credit if you join late.
Often, people will ask about their chances of making it off the waitlist. Unfortunately, that can be hard to answer! In some quarters, the waitlist moves a lot; in others, not at all.
Acknowledgements đ
This offering of DSC 80 builds off of prior offerings by Aaron Fraenkel, Sam Lau, Suraj Rampure, Justin Eldridge, Marina Langlois, Tauhidur Rahman, and Duncan Watson-Parris. Along with the help of their tutors and TAs, they developed much of the content that we will use in this course.