Rasesh Udayakumar Shetty
Mathematics and Computing Undergraduate | AI/ML Enthusiast | Competitive Programmer | Web Developer
About Me
I'm a passionate Mathematics and Computing undergraduate at the Indian Institute of Technology (BHU), Varanasi. With a strong foundation in Data Structures, Algorithms, and Mathematics, I'm deeply interested in AI/ML and its applications. I love solving complex problems and building innovative solutions in the field of computer science and artificial intelligence.
Skills
Projects
Developed a novel architecture using VAEs for detecting AI-generated images with 99% accuracy on similar latent space and 75% on unseen datasets. Finetuned Qwen2-VL model for reasoning.
Technologies: PyTorch, PyTorch Lightning, Gemini API
Designed and trained a custom architecture for GIF Visual Question Answering using PyTorch, incorporating BLIP-2's qformer and Llama3.2-1b.
Technologies: PyTorch, BLIP-2, Llama3.2-1b
Implemented a UNet3+ model for segmenting retinal blood vessels using the CHASE_DB1 dataset. Employed advanced preprocessing techniques, including green channel extraction and morphological transformations, to enhance vessel visibility and suppress background noise. Achieved improved segmentation accuracy and generalization across biomedical images.
Technologies: Python, PyTorch, OpenCV, NumPy
Developed a Generative Adversarial Network (GAN) to generate realistic anime-style faces. The model was trained on a dataset of anime images, utilizing deep learning techniques to capture the unique features and styles characteristic of anime art. The project demonstrates the application of GANs in creative domains, highlighting the potential of AI in art and design.
Technologies: Python, TensorFlow, Keras, NumPy, Matplotlib
Trained Neural Networks to solve Ordinary Differential Equations using PyTorch, gaining insights into function learning using Neural Networks.
Technologies: PyTorch, Neural Networks
Education
Indian Institute of Technology (BHU), Varanasi
Integrated Dual Degree in Mathematics and Computing
August 2023 - Present
CPI: 9.43/10
Relevant Coursework: Data Structures and Algorithm, Computer System Organisation, Discrete Mathematics, Real Analysis, Linear Algebra
Publications
Collecting and curating a large-scale dataset of 65M images and 200M question-answer pairs in the standard museum catalog format for exhibits from all around the world; training large vision-language models on the collected dataset, and benchmarking their ability on five visual question answering tasks.
Contact Me
Get in Touch
Feel free to reach out to me for any inquiries, collaborations, or just to say hello!