Software Architect, Agent Evaluation & Core Framework Job at Datagrid AI, San Francisco, CA

WE1nVXovWjhEeERRMlRUc0hnQ2hlQzBSQ1E9PQ==
  • Datagrid AI
  • San Francisco, CA

Job Description

Software Architect, Agent Evaluation & Core Framework Software Architect, Agent Evaluation & Core Framework Software Architect, Agent Evaluation & Core Framework Job Title: Software Architect, Agent Evaluation & Core Framework Location: Remote First SF Bay area preferred About Datagrid Datagrid is the AI Agent that gets work done for you. Instead of just answering questions, Datagrid’s agents take action—automating entire workflows across your tools, files, and systems. Whether it’s searching through documents to find answers, cross-referencing data to uncover gaps, or running a financial analysis that updates your Excel file—Datagrid does the work, so you don’t have to. You get your time back. You 10x your output. The AI runs the playbook. Behind the scenes, Datagrid connects to over 100 platforms and 2,000+ APIs—Excel, Google Docs, SharePoint, Slack, PDFs, websites, and more. It handles multi-modal problems like handling unstructured data like images and documents, as well as entire databases with ease, and communicates through channels like Teams, Slack, or SMS. It’s built for trust and precision: agents cite their sources and operate safely in real-time. Enterprise teams get full control with teamspaces, RBAC, and usage reports. You can customize everything—launch fast on your own, or partner with our expert team. From research to reporting, from digging through files to delivering results— Datagrid doesn’t just assist. It executes. We’re looking for passionate individuals to join us at the frontier of AI innovation. About the role: Datagrid Agents operate where our customers work-across Teams, Slack, and even SMS. Agents make multistep plans, leverage vectorized data from 100+ sources, use tools like Docusign, and manipulate the Datagrid app Software Architect, Agent Evaluation & Core Framework, is crucial because we cannot manually test the vast array of agent interactions and capabilities. You will own and drive extending our evaluation harness to provide actionable reports on agent regressions and improvements, directly impacting strategic direction and customer experience. A key part of this will be incorporating the best open-source benchmarks into our evaluation set, and figuring out how to Agentically generate evaluations that are representative of customer use cases. As you become established, you will also have the opportunity to make fundamental changes to the Core Framework to improve the way Agents reason, use tools, and collaborate with humans. What you’ll do: Work closely with an ex-Googler who built Gemini evals to create a harness for evaluating Agent performance, make that harness available both for local development and in CI/CD pipelines, and set up alerting for when Agents misbehave. Influence and contribute to the extension of Datagrid’s Agentic capabilities. Choose the best open/closed source components to build out the testing infra. Integrate publicly available benchmarks such as RAGBench into the testing system. Grant subject matter experts the ability to add to the test library using customer queries, manually authored cases, and synthetically generated questions. Expose evaluation performance via alerts and dashboards What you’ll have: Proven track record of building test harnesses for Chat Agents from 0 ⇒ 1. 10+ years of B2B software engineering experience. Ability to write effective LLM prompts without assistance. Proficiency with nodejs and server side frameworks such as NestJS or NextJS. Familiarity with JavaScript frameworks such as React, Angular JS. Experience with databases such as Weaviate and BigQuery. Experience working with GCP or similar cloud providers. Nice to Haves Experience with any LLM evaluation platform (Galileo, Arize, LangSmith Orq) Background in B2B SaaS automation tools Contributions to open-source AI projects or published research Familiarity with prompt engineering or model evaluation Pay Range and Benefits $200,000 – $240,000 USD per year, depending on experience and qualifications. At Datagrid we set pay ranges using market data, internal benchmarks, and the scope of responsibilities. Final compensation within this range will be determined based on relevant experience, skills, and geographic location. In addition to base salary, this role may be eligible for: Equity in the company Home office set-up reimbursement Health, dental, and vision benefits Flexible PTO and remote work options Equal Opportunity Employer Datagrid is an equal opportunity employer and is committed to building a diverse and inclusive team. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law. We encourage candidates from all backgrounds to apply. Seniority level Seniority level Mid-Senior level Employment type Employment type Full-time Job function Job function Engineering and Information Technology Industries Software Development Referrals increase your chances of interviewing at Datagrid AI by 2x Get notified about new Software Architect jobs in San Francisco Bay Area . Solutions Architect, Financial Services, Google Cloud Sunnyvale, CA $147,000.00-$216,000.00 1 week ago San Francisco, CA $100,000.00-$250,000.00 3 weeks ago San Mateo County, CA $114,901.00-$131,210.00 5 days ago Architect / Technical Lead - System / Packaging / Thermal Palo Alto, CA $200,000.00-$235,000.00 2 weeks ago Technical Lead/ Manager, Software Engineering Hayward, CA $170,000.00-$190,000.00 1 month ago San Francisco, CA $70,000.00-$150,000.00 2 weeks ago Santa Clara, CA $148,000.00-$235,750.00 3 days ago Solutions Architect, Data Processing - New College Graduate 2025 Santa Clara, CA $120,000.00-$235,750.00 3 days ago Senior Software Engineer, Handshake Plus - Monetization San Francisco, CA $160,000.00-$230,000.00 4 months ago Staff Software Engineer, Customer Obsession Sunnyvale, CA $223,000.00-$248,000.00 3 days ago San Francisco, CA $150,000.00-$200,000.00 7 months ago Principal Solutions Architect - Enterprise Architecture, Application and AI San Francisco, CA $149,515.00-$175,900.00 2 weeks ago Senior Software Architect - Data Center Systems Staff Software Engineer (Endpoint Client) Senior Architect, AI Solutions Engineering San Francisco, CA $118,200.00-$204,300.00 5 days ago We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI. #J-18808-Ljbffr Datagrid AI

Job Tags

Full time, Local area, Home office, Flexible hours,

Similar Jobs

Host Healthcare

Local Contract School Physical Therapist (Athletic Trainer) - $44-48 per hour Job at Host Healthcare

Host Healthcare is seeking a local contract Physical Therapist for a local contract job in Sylva, North Carolina. Job Description & Requirements ~ Specialty: Physical Therapist ~ Discipline: Therapy ~ Start Date: 10/20/2025~ Duration: 13 weeks ~40 hours ...

Harlow's School Bus Service

Activity Driver Job at Harlow's School Bus Service

Job Summary School Bus drivers are responsible for safely transporting students to and from school and related events. School bus drivers pick up students at a designated location, such as street corners or homes and drop them off at school. Schedule: Part-Time... 

Rock Grading LLC

Dump Truck Driver Job at Rock Grading LLC

 ...Description Seeking a dependable and experienced Class A or Class B CDL Driver with reliable transportation for full-time, steady year-round...  ...and random drug testing during employment. Ability to load truck on occasion is a plus. -the-team Job responsibilities... 

Global Channel Management

Customer Service Monitor Tech Job at Global Channel Management

 ...About the job Customer Service Monitor Tech Customer Service Monitor Tech needs 1 year customer service experience Customer Service Monitor Tech requires: Technical aptitude with PC, able to help login and navigate through a variety of software applications... 

Loyola University Chicago

Biostatistician I Job at Loyola University Chicago

 ...Position Details Position Details Job Title BIOSTATISTICIAN Position Number [click to reveal phone number]8150848 Job Category University Staff Job Type Full-Time FLSA Status Exempt Campus Off-Campus/Remote Department Name PUBLIC...