CS 598 LAZ

CS 598 LAZ: Cutting-Edge Trends in Deep Learning and Recognition

Instructor: Svetlana Lazebnik (slazebni -at- illinois.edu)
Lectures: T TH 12:30-1:45, 216 Siebel
Instructor office hours: Tuesdays 2-3PM or by appointment, 3308 Siebel

TA: Arun Mallya (amallya2 -at- illinois.edu)
TA office hours: Mondays 2-3PM, Wednesdays 3-4PM, 3340 Siebel

Always check announcements on Piazza for short-notice changes to instructor and TA office hours!

Contents: topic list, requirements, schedule, resources

Important links: lecture videos, Piazza (announcements, discussion board), Compass (submission of project deliverables, grades)

Overview

This is an advanced graduate seminar studying current research literature on trends and topics in deep learning, primarily applied to computer vision and language. Topics include state-of-the-art neural architectures and training techniques, recurrent models, neural generative models (adversarial networks and variational autoencoders), deep reinforcement learning, self-supervised learning, language and image-language models, and applications to audio and robotics. Requirements include a group presentation, a final project, literature reviews, peer grading, and participation (see details below).

Prerequisites: Equivalent of introductory courses to machine learning and computer vision, and working knowledge of standard feedforward convolutional neural networks (this course is not an introduction to deep learning).

Detailed Topic List and Readings

Group presentation (50%)

Students will form groups of up to three and jointly develop a lecture on an assigned topic to be delivered on an assigned date. Each group member must deliver a portion of the presentation. See here for a detailed list of topics with suggested initial reading lists.

Signup: Shortly after the first lecture on Tuesday, January 17th, an email will go out to all the registered students with the link to a signup sheet (the link will also be posted on Piazza). Topic assignment is first come, first served, though in case nobody signs up for some topics, we may ask some students to switch to ensure coverage. Signup deadline for registered students is 9AM on January 18th. If you fail to sign up promptly, you will lose 10% of your presentation grade, and may not receive a presentation spot at all, making it impossible to pass the course.

After the deadline for registered students, any unregistered students interested in taking the course are free to sign up for the remaining spots. The presentation schedule will be finalized by the end of Wednesday, January 18th.

Keep in mind that the act of signing up for a topic is a commitment to your teammates, the instructor, and all the other students in the class. Therefore, if you are unsure whether you will stay in the course, we urge you to make this decision now if at all possible.

Outline and reading list (10% of presentation grade): By the end of Sunday the week before you are scheduled to present (i.e., either 10 or 12 days before your presentation), one member of the group must email Lana a brief outline and list of references you plan to cover. The list should be in HTML-friendly format consistent with the one on the website. These will be posted on the class webpage to enable other students to do the readings and prepare for your lecture.This is a real deadline -- failure to submit the outline on time will forfeit this portion of the grade for your team and may negatively affect the ability to schedule a practice presentation.
Practice presentation (20% of presentation grade): After you submit your outline, Lana will schedule a time for your practice presentation approximately a week before your presentation date. The goal is to enable feedback to ensure the highest possible quality of the in-class presentation. All group members must attend. The practice presentation is not expected to be polished or 100% complete, but the grading will be based primarily on evidence that the group is taking the preparation seriously. Even before the "official" practice stage, you are highly encouraged to consult with Lana or Arun during office hours about your reading list or draft slides.
Slides (30% of presentation grade): By the end of the day before the scheduled presentation, the group must email Lana the slides in PowerPoint or PDF format, to be made available to all the students on the course webpage. Be sure to put your names on the title slide and an extended reading list on the final slide. Late submission of the slides will forfeit this portion of the grade. Please see the note on credit attribution below, as failure to follow those guidelines will negatively affect this portion of your grade and may even be considered an academic integrity violation.
In-class presentation (30% of presentation grade): The presentations will be graded based on clarity, technical depth, successful synthesis of content from multiple papers, ability to involve the students, and responsiveness to feedback from the practice presentation. Presentations will be recorded and made available to other students registered for the class.
Peer grading (10% of presentation grade): Each presentation will be scored by six students from the class and their scores will be averaged to form this portion of the grade. The peer grading form is here. For details, see below.

Barring truly exceptional circumstances, all group members will receive the same score for all of the above components (except signup).

Guidelines for creating a successful presentation:

The initial reading lists are not meant to be binding. You should feel free to modify them based on your interest and judgment. Your goal is to identify a thematically coherent group of significant papers that will make for a presentation of high educational value. You are not expected to cover every part of every paper, and your presentation typically should not be organized as a sequence of single-paper summaries. Make sure your list does not contain either too few or too many papers.
Think of yourselves as professors for a day. You should strive to give a comprehensive and understandable lecture introducing the class to a specific topic. It is important to find the right level of technical depth for the audience, to expose the core technical ideas without going into excessive or overwhelming detail. Some presentations will be more like tutorials and some more like surveys, but most should try to identify at least one technical "nugget" that can be taught in reasonable depth to the class.
When preparing your slides, and especially when merging slides provided by different group members, pay particular attention to integration. Make sure that slides covering different topics or papers have a consistent organization, level of detail, and notation. Think about the overall "story" you will tell as you are going from one paper to the next.
Be sure to place your topic in the context of the entire course. Whenever appropriate, take care to point out any specific connections to other presentations that came before.
Where appropriate, feel free to bring a critical perspective to your topic. Go beyond simply describing the techniques. Compare and contrast different papers, question assumptions, expose possible flaws and limitations, suggest alternatives and/or directions for future research. Keep in mind that some of the papers you are covering may not be peer reviewed yet, making skepticism about any extraordinary/unsubstantiated claims warranted (and exercise caution about including any such papers in the first place).
If at all possible or appropriate, you are highly encouraged to include a demo component. You can use code or demos from the Web or implement your own (this could even coincide with your project if your presentation comes later in the semester).
Be sure to involve the class. When you are developing your presentation, identify places where you can ask other students for input, or topics that you want to open up for discussion.
Because timing is hard to predict, you need to maintain some flexibility in terms of the topics you will cover. It is a good idea to have one or two sections in the latter half of your slides that you can skip depending on the time. When you are presenting, keep an eye on the time and adjust the pacing towards the end accordingly.
Use of external sources and credit attribution: Be sure to explicitly give credit whenever you use material from other sources. If you "borrow" any slides or graphics, be sure to give the original source in small font on the bottom of each slide. If you show a demo based on somebody's code, be sure to clearly announce this. Failure to follow these guidelines will hurt your score for the slides, and may even be considered an academic integrity violation. It is not acceptable to use an entire slide deck from another source "as is" as the basis for your presentation.

Group project (30%)

You are encouraged to work on the project with your presentation group, but feel free to form a different group if you want (groups should not be larger than three). The project may take the following forms:

Implementation or demo: Implement one or more methods related to the topics covered in class. Apply existing methods to new datasets. Compare and contrast several methods, adapt or modify them. If feasible, create a demo or show some results in your lecture or on the last day of class.

Paper: Write a survey or tutorial paper on the topic of your lecture (or a different topic if you insist). Models for this kind of paper include Carl Doersch's tutorial on variational autoencoders and Ian Goodfellow's tutorial on generative adversarial networks. If the topic you have chosen already has a good recent tutorial like the two above, this would probably not be the best choice (unless you feel you can write a significantly different tutorial that can offer independent value). The paper should be at least 10 pages in length (single-spaced, single column, 11pt font, 1 inch margins) and typeset in LaTeX.
Video: Create a compelling MOOC-style short video (or several videos) on some concept related to the class -- you can see here for inspiration. If you choose to go this route, the bar for production values is reasonably high.

Project deliverables (submissions on Compass):

Proposal (10% of project grade, due Monday, February 27th): Proposal should be uploaded to Compass in PDF format by one group member and should include: (1) names of group members; (2) a description of the proposed project in a half a page or so; (3) key references, including links to any resources you plan to use (especially code and data). Late submissions will receive no credit, but still need to be turned in in order to avoid further penalty on subsequent components of the project grade.
Progress update (10% of project grade, due Monday, April 3rd): A summary of your current efforts, with notes on any modifications to your original project goals. If you are writing a tutorial or survey paper, you should provide a rough draft of at least three pages. If you are doing an implementation project, at the very least, you should show evidence of successfully running baseline code (e.g., training an off-the-shelf model) on your target data.
Final deliverable (80% of project grade, due Monday, May 8th): Either an implementation report with results, the completed paper, or the video(s). Selected project teams will do a short presentation or demo on the last day of class.

Format for implmentation report: The final report should be submitted in PDF format by one designated group member on Compass. It should be (the equivalent of) at least six pages (single-spaced, 11 point font, 1 inch margins) and mimic the style of a research paper. It is not necessary to submit code. Here is the outline to follow for the report:

Cover page: executive summary: List title and authors. Briefly summarize your problem, line of attack, and most interesting/surprising findings. Be sure to include at least one diagram or example result figure. This is not counted in the six-page minimum.
Introduction: Define and motivate the problem, discuss background material or related work, and briefly summarize your approach.
Details of the approach: Include any formulas, pseudocode, diagrams -- anything that is necessary to clearly explain your system and what you have done. If possible, illustrate the intermediate stages of your approach with results images.
Results: Clearly describe your experimental protocols and identify any external code and datasets used. Present your quantitative evalution (if any) and show some example outputs. If you are working with videos, put example output on YouTube or some other external repository and include links in your report.
Discussion and conclusions: Summarize the main insights drawn from your analysis and experiments. You can get a good project grade with mostly negative results, as long as you show evidence of extensive exploration, thoughtfully analyze the causes of your negative results, and discuss potential solutions.
Statement of individual contribution: Required if there is more than one group member.
References: including URLs for any external code or data used.

Peer grading reports (10%)

Each student will be assigned to grade two presentations and will have to turn in two peer grading reports (DOC, PDF) in the course of the semester. Each peer grading report is worth 5% of your total course grade, so please take it seriously. These reports serve two purposes: to provide constructive feedback to your fellow students, and to encourage you to engage in depth in topics other than your own. Reports will be anonymous to the other students, but not to the instructor. The scores in the reports will be used to calculate the peer portion of the presentation grade for the respective team, and with rare exceptions, they will be shared with the team (but not with the class more broadly).

Reports should be submitted by email to Lana. In the subject of the email, put "mm/dd peer grading report" (where mm/dd is the date of the lecture you are reviewing). For Tuesday presentations, the reports are due by the end of Friday of the same week, and for Thursday presentations, they are due by the end of the following Sunday. Late reports will be penalized 20% (or 1% of your total course grade) for each day they are late.

Participation (10%)

You are expected to come to class most days and participate in discussions both during class and on the Piazza discussion board (a thread for questions and comments will be created for each lecture).

Schedule (in progress)

Presentation teams
Presenters: Reading lists and outlines are due by the end of Sunday before the week you are scheduled to present, followed by a practice presentation. Finalized slides are due the night before your presentation. Come to class at least five minutes early to make sure that your laptop works with the projector.
Peer graders: Reports for Tuesday presentations are due by email to Lana by the end of Friday, and reports for Thursday presentations are due by the end of the following Sunday.

Date	Slides	Reading list
January 17	Class intro	N/A
January 19	CNN architectures (Lana): PPT, PDF	Reading list
January 24	RNN Tutorial (Arun): PPT, PDF	Reading list
January 26	RNN Tutorial Part 2 (Arun): PPT, PDF	Reading list
January 31	Advanced CNN architectures (Akshay, Hong): PPT, PDF	Reading list
February 2	Advanced training techniques (Prajit): PPT, PDF	Reading list
February 7	Network compression, speedup (Shuochao, Yiwen, Daniel): PPT, PDF	Reading list
February 9	Object detection (Jiajun, Sihao, Kevin): PPT, PDF	Reading list
February 14	Semantic segmentation, dense labeling (Liwei): PPT, PDF	Reading list
February 16	Similarity learning (Moitreya, Yunan): PPT, PDF	Reading list
February 21	Visualization, adversarial examples (Ralf, Jyoti, Jiahui): PPT, PDF	Reading list
February 23	Generative adversarial networks (Shashank, Bhargav, Binglin): PPT, PDF	Reading list
February 28	Variational autoencoders (Raymond, Junting, Teck-Yian): PDF	Reading list
March 2	Advanced generation methods (Ameya, Hsiao-Ching, Anand): PPT, PDF	Reading list
March 7	3D + graphics (Juho, Qi): PPT, PDF	Reading list
March 9	Self-supervised learning (Nate, Christian, Pratik): PPT, PDF	Reading list
March 10	Intro to reinforcement learning -- bonus lecture (Unnat, Garima, Karan): PDF 10-11:30AM, SC 216	Reading list
March 14	Deep Q learning (Unnat, Garima, Karan): PPT, PDF	Reading list
March 16	Deep reinforcement learning: policy gradients, planning (Tanmay, Raj, Zhizhong): PDF	Reading list
March 28	Deep learning for manipulation, navigation (Tanmay, Andrey): PPT, PDF	Reading list
March 30	Recurrent architectures (Abhishek, Anusri): PPT, PDF	Reading list
April 4	Image captioning (Anjali, Liaonan, Ruihan): PDF	Reading list
April 6	Image-text embeddings, grounding (Qing, Yang): PDF	Reading list
April 11	Visual question answering (Liang-Wei, Shuai): PDF	Reading list
April 13	Deep learning for NLP (Zeqiu, Quan, Dongming): PDF	Reading list
April 18	Deep learning for machine translation (Yiren, Shaoshi, Yisi): PPT	Reading list
April 20	Deep learning for audio (Matt, Yuchen, Chris): PPT, PDF	Reading list
April 25	Architectures with memory (Nitish, Shreya): PPT, PDF	Reading list
April 27	Meta-algorithms (Mariya, Safa): PDF	Reading list
May 2	Project presentations: PDF

Useful Resources

Books

Courses

Guides to deep learning

Tutorials, blogs, demos

Convolution arithmetic tutorial: web, arXiv
TensorFlow Playground
http://distill.pub/
http://karpathy.github.io/
Arun's dilation demo
Arun's explanation of LSTM Forward-Backward passes
Mixture Density Network tensorflow