My Projects

Here you'll find a collection of projects and experiments I've worked on, ranging from web development and open-source tools to personal learning explorations.

Research Work

  • Object Hallucinations in Multimodal Large Language Models: A Survey [github]
    Multimodal Large Language Models (MLLMs) have shown remarkable advancements in tasks that integrate textual and visual data. However, they often generate outputs that are inconsistent with the provided visual content, raising concerns about their reliability. This survey aims to provide insights into the current research landscape regarding hallucinations in MLLMs, integrating existing knowledge and identifying future research directions.
  • Relation Extraction using LLMs [github]
    A survey of relation extraction methods, focusing on their evolution, challenges, and future directions. It covers traditional methods, deep learning approaches, and the latest advancements in the field.

Serious Projects

  • HackCast [Demo] [Source]
    A full stack web app that will shows you hackernews but in audio format!
  • Minima [Demo] [Source]
    A text editor I made using react js

Fun Projects

  • Poe.try [Demo] [Source]
    Poe.try is a minimalist poetry reading application designed to enhance your poetry consumption experience. The app provides a clean, distraction-free interface for reading poems with elegant typography and smooth animations. The design of the app was inspired by apps like tiktok and instagram reels which recently gave rise to the issue of "doomscrolling", which I wish never existed.
  • English to Brainrot Translator [Demo]
    Converts brainrot to english and vice versa.
  • Bollywood-GPT [Demo] [Source]
    I fine tuned a GPT-2 model on a corpus of bollywood movies so that when I provide it genres, the model is able to generate summaries of new movies and also recommend suitable actors and titles for the project.