Article
Pratik Bhavsar
pratikbhavsar
AI & ML interests
LLM agents, evaluation & reasoning
Recent Activity
liked
a Space
3 days ago
OpenEvals/evaluation-guidebook
updated
a Space
19 days ago
galileo-ai/agent-leaderboard
commented on
their
article
27 days ago
Agent Leaderboard: Evaluating AI Agents in Multi-Domain Scenarios