i_am_a_fox

I am Harsh Raj, an Applied Scientist at VijilAI, where my focus is making AI agents trustworthy.

Broadly, I am interested in making language models safe, useful, and controllable. Over the past few years, I have taken my first baby steps as a researcher, owing to some wonderful people and collaborations.

Most recently, I am working with some folks from CAIS (particularly Dom) on mitigating finetuning attacks on LLMs and reward hacking as a consequence of it. Being an Applied Scientist at VijilAI, I am working on building the largest database of red teaming prompts with Leif.

Before that, I worked with Subho, Dom, and Vipul on evaluating and improving the consistency of language models.

I was fortunate to collaborate via the MLC community with Yash and Laura on quantifying the robustness transfer from pretraining to downstream tasks from the lens of computer vision.

I did my bachelor thesis with Anil S. Parihar on Vision and Language Navigation (VLN) and fortunately, we secured a top-3 position in the most popular VLN challenge R2R.

I also spent my summer internship during my undergrad at Thoucentric as a researcher with Manu where I studied tabular data and built a novel deep learning framework.

News and Timeline

2024

2023

2022