Kaustubh Dhole

Logo

View My GitHub Profile

I’m a researcher at Emory University’s Department of Computer Science working with Prof. Eugene Agichtein. I’m interested in a wide variety of problems which generally fall under Natural Language Processing & Information Retrieval.

I completed my bachelor’s from BITS Pilani, India after which I worked for 6 years in the domain of Conversational AI at Amelia.ai (IPsoft) in the wonderful cities of Bangalore & New York. I got the opportunity to lead the R&D team of around 10-20 enthusiasts working on diverse NLP topics viz., intent classification, slot tagging, VerbNet & PropBank parsing, KB-based QA, paraphrasing, semantic parsing, relation extraction & dialog retrieval, ranking, generation and other conversational AI problems. Much of the work also involved managing a team of back-end, front-end, and UX developers for creating different modules of the Amelia stack.

In the summers of the past 3 years (2021 to 2023), I worked with the Natural Understanding Team at Amazon, Alexa in New York, and San Jose on multi-task learning for their LLMs and creating simulators for training LLMs, and in the Search Experience Science team at Seattle ⛰️.


Areas of Interest:
NLG Evaluation, Query Reformulation, Retrieval Augmented Generation, LLM Biases & Stereotypes
Other Areas I'm happy to collaborate or have coffee chat ons:
Dialog Systems, Graph Neural Networks, Data Augmentation, Efficient Transformers, Privacy Preserving ML, Bigger Picture of LLMs

Publications: Most upto date stuff can be found on Semantic Scholar and Google Scholar.


Workshops:
Organizing the Generation, Evaluation & Metrics Workshops GEM 2021 & GEM 2022 . I was also the co-organizer of the wisdom-of-researchers collaboration NL-Augmenter and a key contributor of the LLM benchmark BIG-Bench.
Mentoring/Speaking:
Presented some of the work on RAG evaluation at the Workshop on Task Focussed IR in the Era of Generative AI at Redmond, Microsoft Research
Gave a talk on Retrieval Augmented Generation at the University of Edinburgh, 2024 while on my visit to present LLM based reformulation at ECIR 2024, Scotland</dd>
Mentored 5 graduate students on efficient variants of GNNs at the London Geometry & Machine Learning Summer School, 2022
Invited as Speaker & Guest of Honour at VIT's ICAITR 2021, Mumbai. Gave a short talk on "NLP in the Past Decade"
Bioinformatics article was featured on-line on Global Medical Discovery [ISSN 1929-8536] as a Key Scientific Article contributing to excellence in biomedical research.
</dl>
Recent Lectures on Retrieval Augmented Generation (May 2024):
Video 1 Video 2 Video 3 Video 4
If you want to get in touch or are interested to collaborate, you can reach me at firstname.lastname@emory.edu (or LinkedIN or Twitter where I'm sometimes active.) Long ago, I used to maintain a personal blog on WordPress where I mostly wrote non-NLP stuff on rare occasions! You can find some of my random writings on Politics, Linguistics, some book reviews and sometimes when I've gone backpacking! One serious advice - cook this!