Seasons of Code

Stable Diffusion

Machine Learning

Mentors :

Shubham Hazra
Om Godage (21d100006)
Kartik Gokhale (200100083)

Mentees :

Elaborate upon the work and learning involved in the project. Suggest some reading material and resources for mentees to gain context and spark their interest. You may also link GitHub repos/Demonstrations, if the project is an already existing one. Stable Diffusion is a powerful text-to-image AI system, can create photos in the style of cartoonists, 19th century daguerreotypists, stop-motion animators and more. Text to image diffusion models are an exciting area of research in machine learning that aims to generate high-quality images from textual descriptions. This technology has a wide range of applications, such as generating images for artistic or commercial purposes, enhancing accessibility for visually impaired individuals, and aiding in virtual reality and game development. They use a generative approach that involves learning the statistical relationships between text descriptions and corresponding images. This involves training a machine learning model on a large dataset of paired text and image data, which it uses to generate new images based on textual descriptions that it has not seen before. Students can checkout DALL.E-2 by openAI: https://openai.com/product/dall-e-2
https://www.unrealengine.com/en-US/unreal-engine-5 https://docs.unrealengine.com/5.0/en-US/
It's not that hard!! This guy made an open world game in 24 hours. https://www.youtube.com/watch?v=3DjY1T42b_M
PreReqs:
Basic Python and willingness to learn

Tentative Timeline :

Week Number	Tasks to be Completed
Week 1	Basics of Regression & Classical ML
Week 2	Intro to Deep learning & frameworks (Tensorflow, PyTorch)
Week 3	Image Processing using OpenCV & classical methods
Week 4	Dive into CNNs & transfer learning
Week 5	Intro to NLP , text encoders and decoders
Week 6	Learning about GANs, Autoencoders and Attention models
Week 7	Starting with Stable Diffusion
Week 8	Finishing up with the training and implementing the pipeline
Week 9	Finishing up with documentation and submission

Topic: All ML DEVELOPMENT BLOCKCHAIN CP MISCELLANEOUS

TEXT SUMMARIZATION WEB APP

Competitive Programming

Write yourself a Git!

File Compression System

FAST-G

Developing Trading Strategy with Pine Script

Real time Driver Drowsiness detection System

The Image Cartoonifier

Speech to Speech Translation

Competitive Programming - Newbie to Master

Path-Planning of Swarm Robotics in 2/3D space

Image Super Resolution using Deep Neural Networks

Enhance Low Resolution Image using GANs

MyBox

Deep Carlsen

InstiExchange - A web marketplace for IITB

Homomorphic Encryption for k-NN on the Cloud

TRayCer

Social media website with MERN

Dive into Digital Image Processing

Neural Quest

To the Quantum Future

Street Fighter II - Reinforcement Learning

Combinatorial Computing

Navigating the Waters of AI

Autonomous Driving Vehicle

Author Identification through Stylometric Analysis

Breakout Genius - Using RL to Build an AI Game Master

Image Captioning

Cricbuzz

Competitive Programming (CP)

Economics meets Machine Learning

SynerG Lab, CSE Department - Webpage

Computer vision for driverless vehicles

AudioHive A Social Audio App

Learning the Latent structure in LLMs

EdConnect

NFTs Where Art and Tech Converge

Dive into the World of Quant

COLLIDE

Image Processing and Object Detection

Unreal IITB

Human Pose Estimation

Find me out

Sign Language Recogniser

E-Commerce Website for VibSpecLab, IIT Bombay

Image Caption Generator

Intro Full stack web development:Restaurant website

App for Credit rating of Retailers in Clothing Industry

Ray Tracing

YOLO-Cam-Object Detection based Analytics

Physically Based Rendering

A Secure Erasure Code-Based Cloud Storage System with Secure Data Forwarding

Graph Machine Learning

“The Watchdogs” - Solving a murder mystery using Computer Vision and Data Science

ArgueAI

Comic GPT

InstiNav

Face detection for attendance using AI

Hands on Reinforcement

RegExamaton

Institute OnChain Voting System with ZKPs

Using Deep RL and NLP to allocate stocks in portfolio

Blockchain Development- It's not that difficult!

Light field imaging and Dual Attention Networks

Stable Diffusion

FlappeRL

Hands-on Computational Physics

Image Colorization

PaperPal

JobFinderX