
Vishnu Sathwik | NLP Researcher & AI Enthusiast
Welcome to my digital realm! I'm Vishnu Sathwik, an undergrad at IIIT Hyderabad with a passion for pushing the boundaries of AI and NLP.
My core focus lies in enhancing reasoning capabilities of language models—from tackling complex mathematical problems to enabling smarter, context-aware AI systems. I'm particularly interested in multimodal AI that can understand and reason across different forms of data, including text, images, and other modalities.
When I'm not diving into the depths of AI research, you might find me fine-tuning models, attending NLP workshops, or writing about the latest breakthroughs in the field.
Let's innovate together and shape the future of intelligent systems!
About Me
I am Vishnu Sathwik, currently pursuing B.Tech in Computer Science at IIIT Hyderabad. I am currently a member of the Precog research group at IIIT Hyderabad, working under the mentorship of Dr. Ponnurangam Kumarguru. My research focuses on large and small language models (LLMs and SLMs), with a particular intrest on enhancing their reasoning capabilities and multimodal understanding. I am passionate about pushing the boundaries of AI by developing systems that can think more intelligently and become increasingly context-aware across different modalities.
I have been fortunate to collaborate with experienced researchers during my academic journey. In summer 2024, I interned at the EMA Lab, IIT Dharwad, under the guidance of Dr. Konjengbam Anand and Mr. Rachit Verma, where I worked on NLP projects that expanded my understanding of AI applications. I also had the opportunity to attend the International Advanced Summer School on Natural Language Processing 2024, organized by the Language Technologies Research Center (LTRC) at IIIT Hyderabad. There, I contributed to fine-tuning models for headnote generation in judicial judgments.
As I move forward, my goal is to contribute to the development of AI systems that can reason and think beyond their training, making AI more capable of complex problem-solving across multiple modalities.
Publications
As an undergraduate researcher, I'm currently in the exciting phase of developing my research portfolio. While I don't have formal publications yet, I'm actively working on several projects at the cutting edge of AI and NLP research.
Research Interests
- Reasoning in Language Models: Developing methods to enhance the reasoning capabilities of LLMs and SLMs, particularly for complex tasks requiring multi-step problem-solving
- Multimodal Reasoning: Exploring the intersection of vision and language to build AI systems that can reason across different modalities
- Mathematical Problem-Solving: Investigating how language models approach and solve mathematical challenges, and designing techniques to improve their performance
- Context-Aware AI: Creating systems that better understand and utilize contextual information across diverse scenarios
I'm committed to contributing meaningful research to the AI community. Stay tuned for upcoming publications!
News & Updates
Joined as an undergrad researcher at Precog research group under the mentorship of Dr. Ponnurangam Kumarguru
New blog post: Beautiful Math Behind Principle Component Analysis
New blog post: Behind the Magic: Training Large Language Models
Started as a full-time student at IIIT Hyderabad
Participated in IASNLP Summer School at IIIT Hyderabad
Completed internship at EMA Lab, IIT Dharwad
Secured lateral entry admission to IIIT Hyderabad (B.Tech + M.S)
Delivered talk on Neural Networks at IIIT Kottayam
Get in Touch
Contact Information
Let's Connect
I'm always open to new opportunities and collaborations. Whether you have an opportunity in mind, a question, or just want to say hello, feel free to reach out. I'll get back to you!
Download CVBlogs
Beautiful Math Behind Principle Component Analysis. - Explaining the math behind PCA with examples.
Behind the Magic: Training Large Language Models - A basic understanding of LLM training
What happens inside a simple neural network - Understanding neural network fundamentals
Impact of Artificial Intelligence on human jobs - Analyzing AI's influence on employment