FinTech Australia
FinTech Australia
About
About Us
What is Fintech
Contact Us
Policy
Policy
Policy Working Groups
Events
Events Calendar
The Finnies
Intersekt Festival
Members
Corporate Partners
Fintech Careers
Jobs Board
eLearning
Resources
Ecosystem Map
Regulatory Map
Investor Map
EY Fintech Census
Services Directory
News
News
Podcast
Member Portal
FinTech Australia
FinTech Australia
About
About Us
What is Fintech
Contact Us
Policy
Policy
Policy Working Groups
Events
Events Calendar
The Finnies
Intersekt Festival
Members
Corporate Partners
Fintech Careers
Jobs Board
eLearning
Resources
Ecosystem Map
Regulatory Map
Investor Map
EY Fintech Census
Services Directory
News
News
Podcast
Member Portal
Folder: About
Folder: Policy
Folder: Events
Members
Corporate Partners
Folder: Fintech Careers
Folder: Resources
Folder: News
Member Portal
Back
About Us
What is Fintech
Contact Us
Back
Policy
Policy Working Groups
Back
Events Calendar
The Finnies
Intersekt Festival
Back
Jobs Board
eLearning
Back
Ecosystem Map
Regulatory Map
Investor Map
EY Fintech Census
Services Directory
Back
News
Podcast
hero

Companies you'll love to work for

0
companies
0
Jobs
For Employers
Add your job
listings
Contact Us
For Employers
Find Candidates
Directly
Talent Pool
For Candidates
Help Recruiters
Find You
Talent Network
Search 
jobs
Explore 
companies
Join talent network
Talent
My job alerts

Principal AI Evaluation Engineer

Backbase

Backbase

Software Engineering, Data Science
Posted on Dec 9, 2025
Apply now

As a a Principal AI Evaluation Engineeryou will be leading the evaluation efforts in our AI-powered SDLC team. You will own the evaluation strategy for AI assistants and agentic workflows, ensuring they are reliable, observable, and safeguarded with strong guardrails. Beyond hands-on work, you will mentor engineers, lead triage and reporting, and make evaluation a cornerstone of release decisions.

What you'll do

  • Define and lead the evaluation strategy and roadmap for AI-powered SDLC core product
  • Build and oversee evaluation pipelines and guardrails.
  • Build and maintain evaluation datasets (synthetic and real project data) to benchmark AI behavior.
  • Analyze evaluation results, identify gaps, and produce clear, actionable reports for engineering and product stakeholders.
  • Build a culture of innovation and excellence, encouraging continuous improvement and adoption of best practices in AI evaluation and deployment.
  • Collaborate with cross-functional teams to integrate evaluation insights into development.

Who you are

  • Strong understanding of software engineering principles and the software development lifecycle (SDLC).
  • Hands-on experience with test design, test management, observability, and data analysis.
  • Proficiency in Python (or another scripting language) for automating evaluations.
  • Familiarity with AI Agent evaluation methods (faithfulness, answer relevancy, contextual accuracy, tool correctness).
  • Excellent analytical and problem-solving skills.
  • Strong communication and collaboration abilities, able to work with cross-functional teams and stakeholders.
  • Demonstrated ability to mentor engineering talent, fostering collaboration and technical excellence.
  • (Nice to have) Experience with evaluation frameworks, RAG systems, or agentic workflows.
Apply now
Apply now
Apply now
See more open positions at Backbase
Privacy policyCookie policy
FINTECH AUSTRALIA

FinTech Australia exists to help our country become one of the world’s top markets for fintech innovation and investment.

IMPORTANT LINKS
  • Privacy Policy
  • Member Login
  • Join Fintech Australia
  • Contact Us
© 2023 FinTech Australia