Biography

Hey there!

I recently wrapped up my BTech in Computer Science and a Masters by Research in Computational Linguistics from IIIT, Hyderabad. For my thesis, I worked on Narrative Understanding with Dr. Manish Shrivastava. I was also an RA under Dr. Kamal Karlepalem and Dr. Lini Thomas, working on policy complaince in the Securties and Exchanges sector.

Professionally, I’ve had stints as a Data Scientist at Klevu, playing around with improving search by utilizing user data. Now, I’m pushing boundaries at LimeChat in Bengaluru as an SDE (ML), developing next-gen chatbots.

When I’m not buried in code or research, you’ll find me cheering for my favorite football team (#GGMU), watching people go in circles very fast (on both two wheels and four ) or lost in a book. Curious about what I’ve been reading lately? Check out my reading list here

Interests

  • Narrative Understanding
  • Document / Discourse NLP
  • Factuality and Interpretabiltiy in LLMs

Education

  • Masters by Research in Computational Linguistics, 2023

    IIIT, Hyderabad

  • BTech in Computer Science, 2023

    IIIT, Hyderabad

Interests

Emacs

Org Mode ❤

Heavy Metal

Rammstein ❤

Sci-Fi

Doctor Who ❤

Experience

 
 
 
 
 

SDE ML

Limechat

Oct 2022 – Present Bengaluru

• Achieved 25% improvement in entity extraction and search, leading to increasing conversion by more than 25% and reducing bot confusion by as much as 10%

• Designed, implemented, and maintained automated content generation systems, incorporating QA and style transfer using LLM pipelines. This streamlined onboarding for new clients by 87%.

• Developing a Hybrid (Intent + Intentless) chatbot by integrating LLMs like GPT3.5/4 with Rasa chatbots, resulting in a contextual and personalized chat experience for users. Demonstrated a reduction in confusion by over 50%.

 
 
 
 
 

Research Associate

Data Sciences and Applications Centre

Jun 2020 – May 2022 Hyderabad

• Worked on policy compliance in the Indian securities and exchanges sector, leading to 2 publications at WWW’22.

• Achieved over 97% accuracy in automated violation predictions by collaborating with legal professionals and fine-tuning a transformer model, posing the task as a multi label classification problem.

• Funded by JP Morgan Faculty Awards, 2021 & 2022

 
 
 
 
 

Data Scientist

Klevu Oy

Feb 2020 – Jun 2022 Remote

Catalog Enrichment: Worked on developing an end-to-end pipeline to improve search queries by enhancing the catalog through customer reviews. Implemented multiple neural Aspect Based Sentiment Analysis papers.

Language Enrichment: Worked on extending language support for 15 new languages. Extended the duckling parser for these languages to extract price information from user queries in real-time. Utilized Open Multilingual Wordnet to support automatic synonym enrichment for multiple languages. Developed a transformer-based system that supports multilingual word sense disambiguation.

Analytics: Built WordGraph, a tool to visualize relations between search queries to help merchants make data-driven decisions, was featured in Forbes.

 
 
 
 
 

Teaching Assistant

IIIT, Hyderabad

Aug 2019 – Dec 2019 Hyderabad
Teaching Assistant for the Introduction to Linguistics 2 course taught by Dr. Aditi Mukherjee
 
 
 
 
 

Research Assistant

Language Technology Research Centre

Mar 2019 – Oct 2022 Hyderabad

Persuing Research in NLP under Dr. Manish Shrivastava.

Areas of interest include:

  • Events and Timebanks for Indian Languages
  • Narrative Understanding
 
 
 
 
 

Intern

LogicMatter Inc

Jul 2018 – Dec 2018 Hyderabad
Worked on a platform to perform ad-hoc data analysis of IoT devices deployed across the world. Integrated login credentials, and OLAP queries

Recent Posts

Monty Python's Life of Brian

I review Monty Python’s Life of Brian for the March 2020 edition of Ping!

Natural Language Inference

What is Natural Language Inference? Given two statements or sentences $S_1$ and $S_2$, the task of determining whether a hypothesis is …

Projects

*

HinDisSent

Sentence Representations for Hindi using dependency relations

Dependent Transformers

We explore the effects of the injection of source syntax on Transformer based Machine Translation/

Hierarchical Generalization without Hierarchial Bias

A reproduction study looking into how RNNs perform on tasks that require hierarchial generalisation.

Recent Publications

Detecting Regulation Violations for an Indian Regulatory Body through Multi Label Classification

The Securities and Exchange Board of India (SEBI) is the regulatory body for securities and commodities in India. SEBI creates, and …

SEBI Regulation Biography

A system that performs semantic processing of SEBI documents using language models to produce enriched regulations containing timelines …

MARCUS: An Event-Centric NLP Pipeline that generates Character Arcs from Narratives

Character arcs are important theoretical devices employed in literary studies to understand character journeys, identify tropes across …

Task Adaptive Pretraining of Transformers for Hostility Detection

This work explores the gains attributed to Task Adaptive Pretraining (TAPT) prior to fine-tuning of Transformer-based architectures and …

Leveraging Multilingual Transformers for Hate Speech Detection

This work leverages Transformer language models to identify hate speech in a multilingual setting with a pre-trained multilingual …

Contact