CS 601.471/671: Self-supervised Models

Assignments

The homework offers you a chance to practice applying what you've learned. It is intended to complement lectures and office hours, which aim to provide intuition, motivation, and justification for the skills we want you to develop. However, the most effective way to build these skills is by attempting to solve the problems on your own. The process of practicing is far more valuable than simply being provided with the solution.

The course includes 7 weekly assignments leading up to spring break, designed to enhance both your theoretical knowledge and practical abilities. Each assignment consists of written questions and programming components (Python). Assignments will be posted on this website, and submissions should be uploaded to Gradescope.

Here is a tentative list of topics for the assignments:

#	Topic
#1	Algebra, calculus, probability, optimization (gradient descent) recap, understanding softmax function, loss functions (cross-entropy, MSE, etc.), a machine learning problem (classification, evaluation).
#2	gradients and jacobians, lanugage model basics, n-gram language models, softmax and using softmax to build classifiers.
#3	neural network basics, implementing neural networks, understanding and explaining backpropagation, hyperparameter tuning.
#4	neural language model basics, implementing neural language models, tokenization, decoding algorithms.
#5	recurrent neural networks, Transformers, the attention mechanism, implemeting Transformer language models
#6	prompt engineering, in-context learning; retrieval-augmented language models; fine-tuning of language models, distributed training.
#7	alignment with instruction-tuning, alignment with [human] feedback.

Submission: Assignments must be submitted via GradeScope. If you're not familiar with Gradescope, check out its documentation.
Collaboration: Study groups are permitted, but each student must individually understand and complete their own assignments in your own words (not sharing answers), submitting one assignment per person. Remember, you must comprehend and complete your assignments independently, using your own words, and submit one assignment per student.
Using other Resources: We highly recommend utilizing any external resources available to you, as long as you properly acknowledge and credit these sources. Citing an external source when you derive an idea from it will not negatively impact your grade. However, neglecting to properly cite an external source and thereby claiming credit for ideas that are not your own constitutes plagiarism.
Appropriate citations: You must write everything in your own words, and properly cite every outside source you use, including other students. Using ideas from other sources or people without citation is plagiarism. Copying other sources verbatim, even with proper citation, is plagiarism. Don't do that. The only sources that you are not required to cite are the official course materials (lectures, slides, and assignments).
Honor code: We expect students to not look at solutions or implementations online. We take the student Honor Code seriously. We sometimes use automated methods to detect overly similar assignment solutions.
Late days: Got sick? Have a conference to attend? Use your "late days"! Each student has 7 late days to use for assignments. A late day extends the deadline 24 hours. Once you have used all your late days, the penalty is 5% off the final homework grade for each additional late day. For example, if you're late for 3 days on an assignment (beyond your legitimate "late days" capacity, which is 7 days per homework and 10 days in total), you will lose 15% of the points for that assignment. The deadline cutoffs are at 9am each day. There are no fractional late days. If you're late for 1 hour, you lose a full day. Late days policy can also be applied to group assignments. In this case, late day calculations are done separately for each team member. Example: TeamA := John and Jane. Jane has 3 late days to use. John has already exhausted his late days. TeamA delay their report by 2 days. Jane now has 1 (3-2) late days. John now gets penalized (formulate on course page) for excess lateness.
Max delay per assignment: You can use up to 7 late days per assignment (so, if you're late on HW1, you can submit it until the release of HW2). Assignments submitted after 7 late days will not be graded (unless explicit permission is given in advance by the instructor).
Grading: Homeworks are graded by the entire course staff directly on Gradescope. To keep grading consistent, each numbered problem is graded by a single grader, under the supervision of one of the TAs, using a detailed rubric on Gradescope. Under normal circumstances, all homework should be graded within 10 calendar days of submission.
Regrading: Regrade requests can be submitted directly within Gradescope and must include a brief written justification for the request. We encourage students who have questions or concerns about their grades to talk with the course staff before submitting a regrade request. However, no grades will be changed in any student's presence.

In-class Quizzes

There will be 3 in-class quizes. The midterm exams will be paper-based and during the usual class time. The exams will assess students' mastery of the topics discussed in the lectures and homework assignments. The exams also provide feedback to both the student and the instructor, and identify areas that need improvement to inform further learning and teaching. The quizzes will encompass all material from the start of the semester up to the topic specified by the instructor.

Class Project

The objective of the final project is to make use of what you have learned during this course to solve a hard problem.

The final project milestones include: (1) A project proposal, (2) A project midway report, (3) a final report, (4) a final project poster summarizing the technical aspects of the project. See the course calendar for the due dates.

Topic: See this document.
Group work: Students are strongly encouraged to work in groups on the final project (team sizes limited to a maximum of 4 people).
Project proposals: All groups will be required to submit a project proposal (due on the class calendar). The project proposal is a 2-page description of what you intend to do (experiments, datasets, methods, etc.) All documents (proposal, midway report and final report) should use this template. The instructor(s) will provide feedback on these ideas to help the teams with finding a concrete idea. Here is examples of project proposals from previous years:
- Adversarial Prompting of Unlearned Language Models
- Ensemble Domain-Specific Knowledge Distillation
Midway progress reports: Reports discussing the progress made thus far (at most 5 pages; use the same suggested template for the proposals) and elaborates on the remaining work. Describe the progress made, experiments you have run, preliminary results you have obtained, how you plan to spend the rest of your time, etc. While this is called "midway" in practice it should be considered more than halfway! By this milestone, you’re expected to have implemented something, and to have some experimental results to show by this date.
Final poster presentations: All students will present their findings at a poster presentation during the final exam period.
Final report: Students should write code and carry out additional experiments and then write up the results in a standard conference paper format (at most 8 pages; use the same template we had suggested for the proposals). References don't count toward the page limit. Note that longer reports are not necessarily better. Students in groups are required to include a “contributions” section concretely lists each author’s contributions (see Section 8 of this paper, for example). The final report should concisely summarize your findings and answer the following questions: 1. What approach did you take to address this problem, and why? 2. How did you explore the space of solutions? 3. How did you evaluate the performance of the approach(es) you investigated? 4. What worked, what did not work, and why?
Here are examples of final reports from the previous year:
Project grading: The goal of the project is to demonstrate the group’s understanding of the tools and challenges when using self-supervised models. Grading will reflect the quality of the approach, the rigor of evaluation, and reasoning about successes and failures. Grading will also depend on the completeness of the project, the clarity of the writeup, the level of complexity/difficulty of the approach, and your ability to justify the choices you made. Here is the grade breakdown for the projects:
- Project proposal: 20%
- Midway report: 20%
- Quality of final report write-up, implementation and results: 40%
- Final poster and its presentation: 20%
Can I skip the final project presentation? Yes! But you'd lose the grade final presentation. The final project presentation is essentially a replacement for the final exam and is in-person. The date is determined by the registrar (out of my control). You have the option to skip it in which case you (not your team) would lose the grade for the presentation (20% of the total project grade) which is 4% (0.2x0.2) of the total grade.

Content Schedule

The current class schedule is below (subject to change):

Date	Topic	Content	Events	Deadlines
#1 - Tue Jan 21	Course introduction: Course overview Plan and expectations	PPTX, PDF	HW1 is released!
#2 - Thu Jan 23	Language modeling: Definitions and history, Counting and n-grams, Measuring LM quality, Language modeling as a learning problem	PPTX, PDF
#3 - Tue Jan 28	Neural Nets: Definitions Brief history Background (algebra + optimization)	PPTX, PDF	HW2 is released!	HW1 due
#4 - Thu Jan 30	Neural Nets: Background (algebra + optimization) Analytical backprop	PPTX, PDF
#5 - Tue Feb 4	Neural Nets: Analytical backprop Backprop in practice	PPTX, PDF	HW3 is released!	HW2 due
#6 - Thu Feb 6	Neural Nets: Backprop in practice Practical tips	PPTX, PDF
#7 - Tue Feb 11	Simple neural language models: Tokenization and subwords Fixed-window MLP LMs	PPTX, PDF	HW4 is released!	HW3 due
#8 - Thu Feb 13	Quiz 1 - Topics: everything discussed up to this point.
#9 - Tue Feb 18	Recurrent Neural LMs: Introducing RNNs Training RNNs RNNs: Pros and Cons Bonus: Pre-training RNNs	PPTX, PDF	HW5 is released!
#10 - Thu Feb 20	Sampling from LMs	PPTX, PDF		HW4 due
#10 - Thu Feb 20	Transformer architecture: Self-attention	PPTX, PDF		HW4 due
#11 - Tue Feb 25	Transformer architecture: Self-attention A decoder-only Transformer	PPTX, PDF
#12 - Thu Feb 27	Transformer architecture: Transformer architectural variants Computational cost	PPTX, PDF
#13 - Tue Mar 4	Transformer-based language models: Notable models Architectural variants	PPTX, PDF	HW6 is released!	HW5 due
#14 - Thu Mar 6	Transformer-based language models: Architectural variants LayerNorm variants FFN variations: activations, bias and activations KV-drag and Self-attention variants	PPTX, PDF
#15 - Tue Mar 11	Transformer-based language models: Architectural variants Parameter tying Positional encoding Dimensions Tokenizers Pre-training data	PPTX, PDF
#16 - Thu Mar 13	Transformer-based language models: Pre-training data Pre-training optimization Adapting LMs: Adaption as fine-tuning Parameter-efficient tuning	PPTX, PDF
#17 - Tue Mar 18	No Class - Spring Break
#18 - Thu Mar 20	No Class - Spring Break
#19 - Tue Mar 25	Adapting LMs: Adaption as fine-tuning Parameter-efficient tuning Adaption as in-context learning Prompt engineering Multi-step prompting Failures of prompting	PPTX, PDF
#20 - Thu Mar 27	Quiz 2 - Topics: everything up to "Adapting LMs".
#21 - Tue Apr 1	Adaption as in-context learning Prompt engineering Multi-step prompting Failures of prompting Introducing final projects: Defining final projects Tips for successful project	PPTX, PDF PPTX, PDF		HW6 due
#22 - Thu Apr 3	Alignment of LMs: Alignment: definitions Instruction-tuning	PPTX, PDF	HW7 is released!
Apr 4	Project proposals deadline
#23 - Tue Apr 8	Alignment of LMs: RLHF as vanilla policy gradient	PPTX, PDF
#24 - Thu Apr 10	Alignment of LMs: Alignment: failures/open questions RLHF variants Inference-scaling: The reward function The inference algorithms Training a good inference-scaler	PPTX, PDF PPTX, PDF
#25 - Tue Apr 15	Efficiency considerations: Distributed training	PPTX, PDF		HW7 due
#26 - Thu Apr 17	Efficiency considerations: Distributed training	PPTX, PDF
#27 - Tue Apr 22	Efficiency considerations: Distributed training Quantization Distillation	PPTX, PDF
#28 - Thu Apr 24	Quiz 3 - Topics: everything discussed in the class.
Apr 28	Midway reports deadline
Thu May 8	Final project reports
Thu May 8	Final project poster session (Time: 2pm-5pm)

Reference text

There is no required text. Though the following can be useful:

Relevant Resources

Here are several resources available for free:

Compute resources:
- Google Colab provides free GPU usage for up to 12 hours/day for academic purposes. One can obtain more compute on Colab with relatively minimal pay.
- Google offers research TPU credits.
- AWS and Azure both offer welcome credits to students.
- If you need credits to use GPT3/GPT4 or other APIs, discuss it with the instructor.
Tutorials:

A course on Huggingface's Transformers library.
Tutorials on Learn with Torch Lightning

Besides these resources, we will try our best to satisfy individual needs through discussion.

Code of Conduct

The strength of the university depends on academic and personal integrity. In this course, you must be honest and truthful, abiding by the Computer Science Academic Integrity Policy:

Cheating is wrong. Cheating hurts our community by undermining academic integrity, creating mistrust, and fostering unfair competition. The university will punish cheaters with failure on an assignment, failure in a course, permanent transcript notation, suspension, and/or expulsion. Offenses may be reported to medical, law or other professional or graduate schools when a cheater applies. Violations can include cheating on exams, plagiarism, reuse of assignments without permission, improper use of the Internet and electronic devices, unauthorized collaboration, alteration of graded assignments, forgery and falsification, lying, facilitating academic dishonesty, and unfair competition. Ignorance of these rules is not an excuse.

Academic honesty is required in all work you submit to be graded. Except where the instructor specifies group work, you must solve all homework and programming assignments without the help of others. For example, you must not look at anyone else’s solutions (including program code) to your homework problems. However, you may discuss assignment specifications (not solutions) with others to be sure you understand what is required by the assignment. If your instructor permits using fragments of source code from outside sources, such as your textbook or on-line resources, you must properly cite the source. Not citing it constitutes plagiarism. Similarly, your group projects must list everyone who participated.

In the above paragraph "outside sources" also include content that was produced by an AI assistant like ChatGPT. This follows either by treating the AI assistant as a person for the purposes of this policy (controversial) or acknowledging that the AI assistant was trained directly on people's original work. Thus, while you are not forbidden from using these tools, you should consider the above policy carefully and quote where appropriate. Assignments that are in large part quoted from an AI assistant are very unlikely to be evaluated positively. In addition, if a student's work is substantially identical to another student's work, that will be grounds for an investigation of plagiarism regardless of whether the prose was produced by an AI assistant.

Falsifying program output or results is prohibited. Your instructor is free to override parts of this policy for particular assignments. To protect yourself: (1) Ask the instructor if you are not sure what is permissible. (2) Seek help from the instructor, TA or CAs, as you are always encouraged to do, rather than from other students. (3) Cite any questionable sources of help you may have received.

Report any violations you witness to the instructor. You can find more information about university misconduct policies on the web for undergraduates and graduates students.

Johns Hopkins University is committed to equal opportunity for its faculty, staff, and students. To that end, the university does not discriminate on the basis of sex, gender, marital status, pregnancy, race, color, ethnicity, national origin, age, disability, religion, sexual orientation, gender identity or expression, veteran status, military status, immigration status or other legally protected characteristic. The University's Discrimination and Harassment Policy and Procedures provides information on how to report or file a complaint of discrimination or harassment based on any of the protected statuses listed in the earlier sentence, and the University’s prompt and equitable response to such complaints.

CS 601.471/671 NLP: Self-supervised Models

Johns Hopkins University - Spring 2025

Logistics

Key links