Project Manager at Brillio | Hybrid Hired

About the role

AI Agent Evaluation Engineer developing evaluation frameworks for AI agents with an emphasis on safety and ethical standards. Collaborating with AI teams to ensure high-quality performance metrics.

Responsibilities

Evaluation (Evals) Development: Develop synthetic testing environments and simulation strategies to stress-test agents under various real-world conditions.
Responsible AI and Safety Evals (New Focus): Develop and execute adversarial testing, jailbreaking, and red-teaming methodologies to identify potential harm, bias, toxicity, and unauthorized behavior in agent responses.
Test Strategy & Execution: Define comprehensive QA strategies, including functional, integration, regression, and user acceptance testing (UAT) specifically for conversational and goal-oriented AI agents.
Bug Detection & Management: Identify, document, prioritize, and track bugs using Jira, performance degradations, and alignment failures in agent behavior.
Automation & Tools: Integrate evaluation pipelines into the CI/CD process to enable continuous quality assurance and fast iteration cycles.

Requirements

Experience: 6+ years in Software QA, with at least 2 years focused on testing or evaluating AI/ML systems, conversational agents, or Large Language Models (LLMs).
Safety Evals Expertise (Mandatory): Direct experience in designing and executing safety evaluations (red teaming, adversarial testing), bias detection, and measuring toxicity/harmful content in generative AI models.
Agent/LLM Evals: Proven experience developing and running general evaluations (Evals) for LLM-powered applications knowing libraries like PyTest (Must)
Google ADK Familiarity (Mandatory): Direct or strong conceptual understanding of the Google Agent Development Kit (ADK) and its components.
Programming: Strong proficiency in Python is mandatory for script development, data processing, and automation.
Cloud & MLOps: Familiarity with Google Cloud Platform (GCP) services relevant to AI/ML (e.g., Vertex AI) and integrating testing into MLOps workflows.
Tools and Libraries: Langsmith, DeepEval, Ragas, Giskard, Hugging face.

Similar roles

Browse all Project Manager jobs

14 minutes ago

LS

Senior Civil Project Manager

Langan Engineering & Environmental Services

Senior Civil Project Manager at Langan, leading design, permitting, and client management for diverse land development projects. Collaborating with industry leaders in a supportive work environment while ensuring project success.

Onsite Role

Arlington United States Project Manager

yesterday

JO

Project Manager

Jobs2web

Project Manager providing project management leadership for Teradyne's vital projects in Italy's Solution Engineering Group. Focus on cross - functional integration and communication with key stakeholders.

Onsite Role

Milan Italy Project Manager

yesterday

MH

Manager, Project Management

Mission Technologies, a division of HII

Technical Project Manager leading and supporting IT projects from initiation to completion at HII's Mission Technologies. Collaborating with engineering and customer teams to ensure project success.

Onsite Role

Wright-Patterson Air Force Bas United States Project Manager

$99,408 - $140,000 per year

2 days ago

AP

Project Manager

Arrowhead Programs

Project Manager leading planning, execution, and delivery of insurance projects ensuring alignment with business objectives and stakeholder expectations.

Hybrid Role

United States Project Manager

$95,000 - $110,000 per year

2 days ago

NP

Project Coordinator

Nonviolent Peaceforce

Project Coordinator managing Nonviolent Peaceforce’s Programme to reduce violence and enhance security in Ninewa, Iraq. Overseeing the implementation of Unarmed Civilian Protection for social cohesion and safety.

Onsite Role

Erbil Iraq Project Manager

2 days ago

FI

Project Manager – Social & Influencer Operations

Findasense

Project Manager managing social media and influencer projects at a global customer experience company. Driving operational excellence and ensuring project delivery across markets.

Hybrid Role

Bogotá Colombia Project Manager

2 days ago

OR

Project Manager

Origin

Project Manager managing IT and Business Applications for Origina, a growing international company. Supporting project management practices and delivering internal changes in a hybrid environment.

Hybrid Role

Sandyford Ireland Project Manager

2 days ago

JR

Restoration Project Manager

Jenkins Restorations

Sales professional handling emergency service calls for restoration needs. Responding to urgent situations and converting leads into signed jobs in a high - pressure environment.

Hybrid Role

Crossville United States Project Manager

$90,000 - $100,000 per year

2 days ago

IE

Channel Lead – Project Analyst Specialist

Invensys - Acquired by Schneider Electric

Channel Lead Project Analyst Specialist at Schneider Electric, analyzing data and performing audits for business efficiency. Collaborating with teams to meet corporate policies and regulatory requirements

Hybrid Role

Ottawa Canada Project Manager

2 days ago

WP

Project Manager, Platform Business Development

Woven Planet

Project Leader driving business strategies for Robot Platform at Woven by Toyota. Focusing on customer - centric solutions, market research, and partnership development.

Hybrid Role

Tokyo Japan Project Manager