CS 7643 Deep Learning
CS 4644 / 7643 Deep Learning
Fall 2024, Tue/Thu 5:00 pm - 6:15 pm, Clough UG Learning Commons 152
Course Information
Schedule
Grading
Late Policy
Prerequisites
FAQs
Projects
External Resources
Course Information
This is an exciting time to be studying (Deep) Machine Learning, or Representation Learning, or for lack of a better term, simply Deep Learning!
Deep Learning is rapidly emerging as one of the most successful and widely applicable set of techniques across a range of domains
(vision, language, speech, reasoning, robotics, AI in general), leading to significant commercial success and exciting new directions that may previously have seemed out of reach.
This course will introduce students to the basics of Neural Networks (NNs) and expose them to some cutting-edge research.
It is structured in modules (background, Convolutional NNs, NN Training, Sequence Modeling, Generative Modeling, Frontiers).
Modules will be presented via instructor lectures and reinforced with homeworks that teach theoretical and practical
aspects. The course will also include a project which will allow students to explore an area of Deep Learning that interests them in more depth.
Instructor
Danfei Xu (danfei@)
Teaching Assistants
Head TA: Wei Zhou (wzhou322@)
Zachary Breitbart (zacharyb@)
Srikar S Balusu (sbalusu30@)
David He (davidhe@)
Haotian Xue (hxue45@)
Woo Chul Shin (wshin49@)
Huaijin(Tony) Tu (htu35@)
Aryan Sarswat (asarswat8@)
Elias Cho (echo94@)
Rodrigo Loza (rloza3@)
Vaibhav Saxena (vsaxena33@)
Class Info & Links
Lectures: T/Th: 5:00 pm - 6:15 pm, Clough UG Learning Commons 152
Website:
Piazza:
Canvas
CS4644:
CS7643:
Gradescope
CS4644:
CS7643:
Tentative Schedule (subject to changes)
Date
Topic
Optional Reading
W1: Aug 20
Intro lecture + class logistics.
Slides (pdf)
PS0
is due 11:59pm 08/25 (NO grace period). See Piazza for instruction.
IMPORTANT: All students MUST complete PS0! This is true even if you are currently on the waitlist and want to get in! See FAQ below for gradescope access if you are not registered.
LeCun et al., Nature '15
Shannon, 1956
DL book: Linear Algebra background
DL book: Probability background
DL book: ML Background
W1: Aug 22
Machine learning intro, applications (CV, NLP, etc.), parametric models and their components
Slides (PDF)
W2: Aug 27
Supervised Learning, Linear Classification, Loss functions, Gradient Descent
Slides (pdf)
PS0 due 08/26 11:59pm EST.
DL book: Linear Algebra background
DL book: Probability background
DL book: ML Background
W2: Aug 29
Backpropagation, Computation Graph
PS1/HW1 out, due Sep. 17th 11:59pm
Slides (pdf)
W3: Sep 3
Backpropagation with Neural Networks. Optimization Basics
Slides (pdf)
DL book: Deep Feedforward Nets
Matrix calculus for deep learning
Automatic Differentiation Survey, Baydin et al.
W3: Sep 5
How to Pick a Project
Slides (pdf)
W4: Sep 10
Convolution
Slides (pdf)
DL book: Convolutional Networks
W4: Sep 12
Convolution, Pooling, Convolutional Neural Networks
Slides (pdf)
DL book: Convolutional Networks
W5: Sep 17
Convolutional Neural Networks, Training Neural Networks (part 1): Activation Functions, Data Preprocessing
Slides (pdf)
PS/HW1 due Sep. 17th 11:59pm, PS/HW2 out
DL book: Convolutional Networks
W5: Sep 19
Project Helping Session
W6: Sep 24
Training Neural Networks 2: Weight Initialization, Batch Normalization, Optimization
Slides (pdf)
Project Proposal Due Sep 24th 11:59pm
General preprocessing:
cs231n on preprocessing
General preprocessing:
Preprocessing for deep learning: from covariance matrix to image whitening
DL book: Optimization for Training Deep Models
W6: Sep 26
Training Neural Networks 3: Optimization, Regularization, Hyperparameter Search, Transfer Learning, Model Ensembles
Slides (pdf)
DL book: Regularization for DL
DL book: Optimization for Training Deep Models
W7: Oct 1
Recurrent Neural Networks
Slides (pdf)
W7: Oct 3
Attentions and Transformers
Slides (pdf)
PS/HW2 due Oct 3th 11:59pm, PS/HW3 out
Attention is all you need
BERT Paper
The Illustrated Transformer
Formal Algorithms for Transformers
W8: Oct 8
Deep Learning Software and Hardware
Slides (pdf)
W8: Oct 10
Computer Vision: Detection and Segmentation
Slides (pdf)
Fully Convolutional Networks for Semantic Segmentation
W9: Oct 15
No class: Fall Break
W9: Oct 17
Computer Vision (Continued) and Neural Rendering
Slides (pdf)
End-to-End Object Detection with Transformers
Segment Anything
NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
W10: Oct 22
Generative Models - Autoregressive Models and Variational Autoencoders (VAE)
Slides (pdf)
PS/HW3 due Oct. 22nd 11:59pm, HW4 out
W10: Oct 24
Generative Models - Denoising Diffusion Probablistic Models
Slides (pdf)
W11: Oct 29
GANs and Self-Supervised Learning
Slides (pdf)
W11: Oct 31
3D Vision
Slides (pdf)
W12: Nov 5
Natural Language Processing: Large Language Models (Instructor: Will Held)
(Remote Class)
Slides (pdf)
W12: Nov 7
No Lecture --- instructor attending conference
W13: Nov 12
Reinforcement Learning 1: MDP, Value Iteration, Deep Q Learning.
Slides (pdf)
PS/HW4 due 11/12 11:59pm
Sutton & Bartow Chapter 1
Survey paper on Deep RL
MDP Notes (courtesy Byron Boots)
W13: Nov 14
Reinforcement Learning 2: Actor-Critic, Frontiers.
Slides (pdf)
Sutton & Bartow Chapter 1
Survey paper on Deep RL
MDP Notes (courtesy Byron Boots)
W14: Nov 19
Large Vision Language Models
Slides (pdf)
W14: Nov 21
Guest Lecture: TBD (
Sidd Karamcheti
W15: Nov 26
Robot Learning and Class Conclusion
W15: Nov 28
No Class: Thanksgiving Break
W16: Dec 3
Poster Session (Klaus Atrium): 5-7PM
Grading
64% Homework (4 homeworks)
36% Final Project
1% (potential bonus) Class Participation: top endorsed answers/questions/comments on Piazza
Late policy for deliverables
There will be no make-up work provided for missed assignments. Of course, emergencies (illness, family emergencies) will happen. In those instances, please submit an
Class Absence Verification Form to Dean of Students
office (see
here
for rules). The Dean of Students is equipped to verify emergencies and pass confirmation on to all your classes. For consistency, we ask all students to do this in the event of an emergency. Do not send any personal/medical information to the instructor or TAs; all such information should go through the Dean of Students.
Every homework deliverable and project deliverable will have a 48-hour grace period during which no penalty will apply. This is intended to allow you time to verify that your submission has been submitted (we recommend you re-download it and look it over to make sure all questions/deliverables have been answered). Canvas will show your submission as late, but you do not have to ask for this grace period. Deliverables after the grace period will receive a grade of 0.
Prerequisites
CS 4644/7643 should NOT be your first exposure to machine learning. Ideally, you need:
Intro-level Machine Learning
CS 4641 for the undergraduate section and CS 7641/ISYE 6740/CSE 6740 or equivalent for the graduate section.
Algorithms
Dynamic programming, basic data structures, complexity (NP-hardness)
Calculus and Linear Algebra
positive semi-definiteness, multivariate derivates (be prepared for lots and lots of gradients!)
Programming
This is a demanding class in terms of programming skills.
HWs will involve Python and PyTorch.
Your library of choice for project.
Ability to deal with abstract mathematical concepts
Online Student Conduct and (N)etiquette
Communicating appropriately in the online classroom can be challenging. All communication, whether by email, Piazza, Canvas, or otherwise, must be professional and respectful. In order to minimize this challenge, it is important to remember several points of “internet etiquette” that will smooth communication for both students and instructors
Read first, Write later. Read the ENTIRE set of posts/comments on a discussion board before posting your reply, in order to prevent repeating commentary or asking questions that have already been answered.
Avoid language that may come across as strong or offensive. Language can be easily misinterpreted in written electronic communication. Review email and discussion board posts BEFORE submitting. Humor and sarcasm may be easily misinterpreted by your reader(s). Try to be as matter of fact and as professional as possible.
Follow the language rules of the Internet. Do not write using all capital letters, because it will appear as shouting. Also, the use of emoticons can be helpful when used to convey nonverbal feelings. ☺
Consider the privacy of others. Ask permission prior to giving out a classmate’s email address or other information.
Keep attachments small. If it is necessary to send pictures, change the size to an acceptable 250kb or less (one free, web-based tool to try is picresize.com).
No inappropriate material. Do not forward virus warnings, chain letters, jokes, etc. to classmates or instructors. The sharing of pornographic material is forbidden.
NOTE: The instructor reserves the right to remove posts that are not collegial in nature and/or do not meet the Online Student Conduct and Etiquette guidelines listed above.
Plagiarism & Academic Integrity
Georgia Tech aims to cultivate a community based on trust, academic integrity, and honor. Students are expected to act according to the highest ethical standards. All students enrolled at Georgia Tech, and all its campuses, are to perform their academic work according to standards set by faculty members, departments, schools and colleges of the university; and cheating and plagiarism constitute fraudulent misrepresentation for which no credit can be given and for which appropriate sanctions are warranted and will be applied. For information on Georgia Tech’s Academic Honor Code, please visit
or
You are encouraged to discuss problems and papers with others as long as this does not involve the copying of code or solutions. After discussions, all materials that are part of a submission should be wholly your own. Do NOT search for code directly implementing the assignment and submit snippets or variations of them. You can search for conceptual information but NOT code solutions. Any public material that you use (open-source software, help from a textbook, or substantial help from a friend, etc.) should be acknowledged explicitly in anything you submit to us. If you have any doubts about whether something is legal or not, please do check with the class Instructor or the TA.
We will actively check for cheating, and any act of dishonesty will result in a Fail grade. Any student suspected of cheating or plagiarizing on any deliverable including assignments will be reported to the Office of Student Integrity, who will investigate the incident and identify the appropriate penalty for violations.
Students with Disabilities
If you are a student with learning needs that require special accommodation, contact the Office of Disability Services at 404.894.2563 or
, as soon as possible, to make an appointment to discuss your special needs and to obtain an accommodations letter. Please also e-mail me as soon as possible in order to set up a time to discuss your learning needs.
Subject to Change Statement
The syllabus and course schedule may be subject to change. Changes will be communicated via the Canvas announcement tool. It is the responsibility of students to check Piazza, email messages, and course announcements to stay current in their online courses.
Campus Resources
Community Resources
Project Details (36% of course grade)
The class project is meant for students to (1) gain experience implementing
deep models and (2) try Deep Learning on problems that interest them. The
amount of effort should be at least the level of 1.5 homework assignment per
group member (2-4 people per group). The deliverables are
- Project Proposal (1%): Due Sep 24
- Milestone Written Report (10%): Due Nov 4
- Final Written Report (20%): Due Dec 2
- Poster Session (5%): Dec 3, Klaus Atrium
The final report is a PDF write-up describing the project in a self-contained manner will be the sole
deliverable. Your final write-up is required to be between 6 - 8 pages using a standard Computer Science conference
paper template such as CVPR and NeurIPS (we will release the LaTeX template).
Please use this template so we can fairly judge all student
projects without worrying about altered font sizes, margins, etc. After the class, we
will post all the final reports online so that you can read about each others’ work.
Additionally, we will allow people to upload additional code, videos, and other supplementary material
as zip file similar to code upload for assignments.
While the PDF may link to supplementary material, external documents, and code,
such resources may or may not be used to evaluate
the project. The final PDF should completely address all of the points in the
rubric described below.
Rubric
We will release a detailed project rubric and the poster session format on Piazza soon.
FAQs
The class is full. Can I still get in?
Sorry. The course admins in CoC control this process. Please talk to them.
Unregistered Students who intend to register:
If you are not registered for this course, you will not have access to Gradescope for submission of PS0. Just use PS0 as a self-assessment until you manage to enroll in the course.
Students who individually emailed us and have not been added yet - you may have left out the details of which course instance you are planning to take (either CS 7643 or CS 4644). There are two separate Gradescope courses for the two instances. Please fill the above form in order to provide us with this information.
Registered students who are not able to access Gradescope:
This will happen if you were registered to the course very recently. Gradescope rosters are synced periodically and it may take some time for you to receive a Gradescope sign-up notification. If you still face problems with accessing Gradescope, please email us.
I am graduating this Fall and I need this class to complete my degree requirements. What should I do?
Talk to the advisor or graduate coordinator for your academic program. They are keeping track of your degree requirements and will work with you if you need a specific course.
Can I audit this class or take it pass/fail?
No. Due to the large demand for this class, we will not be allowing audits or pass/fail. Letter grades only. This is to make sure students who want to take the class for credit can.
I have a question. What is the best way to reach the course staff?
Registered students – your first point of contact is Piazza (so that other students may benefit from your questions and our answers).
If you have a personal matter, create a private piazza post.
Related Classes / Online Resources
CS231n Convolutional Neural Networks for Visual Recognition, Stanford
Machine Learning, Oxford
Deep Learning, New York University
Deep Learning, CMU
Deep Learning, University of Maryland
Hugo Larochelle’s Neural Networks class
Book
Deep Learning, Ian Goodfellow, Aaron Courville, and Yoshua Bengio, MIT Press
Overviews
Deep Learning, Yann LeCun, Yoshua Bengio, and Geoffrey Hinton, Nature
Representation Learning: A Review and New Perspectives, Yoshua Bengio, Aaron Courville, and Pascal Vincent
Note to people outside Georgia Tech
Feel free to use the slides and materials available online here.
If you use our slides, an appropriate attribution is requested.
Please email the instructor with any corrections or improvements.