About Me
I'm an ML Engineer and Data Scientist working on modular tools for unstructured text data pipelines - focused on asking consistent questions of document sets, then evaluating and displaying the results.
My background is in data science and Python. I earned a Public Policy PhD from RAND, worked as a data scientist for the Department of the Army, and until recently was part of the DHS AI Corps.
My areas of interest include:
- Methods for evaluating and classification of text data, including via LLMs
- Government data workforce and talent management issues
- Code-first data science and processes for better transparency and replication
- DC school data
Get in touch if you're interested in getting involved with/sponsoring Data Community DC or having me speak at your event.
Selected Talks
Selected Writing
Guide for non-technical stakeholders on how to evaluate AI solutions and what questions to ask.
Strategies for organizations to effectively adapt to the widespread use of LLMs in coding workflows.
Comprehensive approach to evaluating and testing LLM-based tools and chatbots.
Exploring strategies for handling non-deterministic AI outputs in production systems.
Techniques for rapidly assessing new LLM capabilities using other models as graders.
Approach to testing AI systems based on their specific threat models.
Strategies for effectively using GitHub to showcase your skills and experience.
Practical advice for early-career data scientists on optimizing their resumes.