Wednesday June 3
Registration
Doors open at 8pm and we'll have the barista coffee flowing and our exhibitors ready to hand out swag and answer all your questions. So get in early, get registered, and get ready for two incredible days.
Welcome
Take a seat and get ready for two intense days of world-class speakers.
How to keep up in AI Engineering
Shawn Wang coined the term "AI Engineer" with his seminal 2023 essay "The Rise of the AI Engineer."
Few people are better placed to help us keep pace with a field that seems…
Token Town (why compute strategy is product strategy)
Token pricing is a noisy headline. What matters in production is what you pay per task once outputs get longer, retries creep in, and “good enough” models get deprecated.
This talk breaks down the current dynamics…
Everything Is a Factory
Software development as we knew it is dead. The old world of hand-crafting every line, endless hiring cycles, and fragile vertical stacks has been overtaken by AI-driven software factories that turn simple prompts into…
Three Lanes Below One Millisecond: A Rust SDK for Gemini Live
Full-duplex voice agents punish the choices text-only agents forgive. A 50ms GC pause is a perceptible glitch. A blocked event loop is a barged-in user talking over your model. Google’s Python ADK is a beautiful kit for…
Keynote outro
We'll wrap up the first session ahead of a well-earned break.
Lunch
We've got great food, coffee, and hallway track sessions.
Make sure you visit our partners, to learn more, get great swag, and win great stuff!
AI Hamsters Engineering: Circling Your Way to Success
Every AI engineer knows the loop — prompt tweaks, evals, regressions, repeat. It feels like going in circles. But that's not the problem. The problem is not knowing if each circle is tighter than the last. This talk is…
Fail fast, fix faster: Why faster AI models beat smarter ones
The smartest model doesn't always win.
In agentic coding loops, a model that is 10x faster but only marginally competent can often fail its way to success before a frontier model finishes reasoning.
AJ Fisher…
Beyond Silicon Valley: Building AI Governance on the Fair Go Principle
Responsible AI isn’t culturally neutral. American AI development embeds distinctly American values — individual liberty, technological solutionism, and winner-takes-all competition. But what happens when these values…
Evaluation Precedes Evolution: Rubrics as the Load-Bearing Infrastructure of Self-Improving Agents
The 2025–2026 wave of "self-evolving" agents — prompt-tuning loops, memory accumulation, agent swarms, GEPA, ReasoningBank — share a structure that is sometimes lost in the jargon: every one of them is hill-climbing on…
Optimising GenAI at Runtime with Experimentation and Guardrails
Generative AI systems evolve constantly, and the impact of prompt or model changes often isn't clear until real users interact with them in production. In this session, learn how teams using Amazon Bedrock safely…
Beyond Forgetful Bots: Architectural Patterns for Persistent, Proactive Claw-Style AI Agents
Most AI agents are reactive chatbots—great for one-off queries, but they reset, forget, and lack initiative, failing in real-world use like personal assistants or autonomous workflows.
This talk dives into the…
Building Frameworks Building Systems
CI infrastructure is ripe for the vibing — so why don’t we? I have been.
Our company must deliver 200 interactive science and maths games for millions of students this year. Only a few years ago were we making 30, by…
Stop Blocking, Start Building: Rethinking Governance for the Agentic Era
As AI moves from "chat" to "act," the risk surface is exploding. Traditional governance is too slow, too onerous, and often looks in the wrong places. Join this session to learn how to:
- Identify the 3 fatal flaws…
Why AI coding tools might not make the slightest difference
Most “AI gives you 10x productivity” stories assume coding is the bottleneck. For large and mature companies this is almost never the case, so you roll out AI coding tools, people feel faster, but delivery metrics…
Shipping Sandboxed Workers for Notion Agents
In this talk, we will share how we built a platform at Notion that allows developers to extend AI agents with custom code. The system enables developers to write small programs that give their agents access to tools…
The State of the AI Engineering Job Market in Australia
Jake Maloney is the founder of AI Jobs Australia, a job board dedicated to Artificial Intelligence, Machine Learning, Data Science, and AI-adjacent emerging roles. That vantage point gives him insight into the…
Who Needs a LoRA?
Can you faithfully edit hand-drawn illustrations using an image model you haven’t fine-tuned? Most practitioners assume you need a LoRA for style-faithful editing. I set out to prove otherwise — building a production…
Constitutional Prompting: Making AI Coding Agents Reliable Without the Iteration Tax
Every engineering team trying to automate developer workflows with AI agents hits the same wall: the iteration tax. You ask an agent to review a PR, scaffold a feature, or audit code quality — it does something almost…
Having your cake and eating it: An implementation guide for privacy with AI
Everyone wants privacy, but the best models require you to give up control of your data. What options are there for keeping data private but while still embracing the promise of AI?
In this talk we'll take a…
Close your agentic loop
Every time you've told an agent it broke the layout of your website, output the wrong schema or failed an invariant - you are the feedback loop. The teams achieving the best outcomes right now are focused on building…
Don't Be Cheap: AI and the Appearance of Engineering
Software engineering has a pattern: demanding practices arrive and get reduced to their ceremonies. Agile kept the standups, lost the discipline. DevOps kept the postmortems, lost the learning. The form survives. The…
How Many Agents Are Too Many? The Hidden Cost of Multi-Agent Systems
Multi-agent systems promise scalability and smarter reasoning—but in production, more agents often mean more cost, latency, and failure. This talk shares real-world engineering lessons, metrics, and architectural…
From Zero to Production: How 15 Engineers Shipped a Production LLM Product with AI Coding Tools
How a team of fewer than 15 engineers at MYOB took an AI-powered chat experience from zero to production, embedded directly inside the product serving real small business owners and accountants. Leaning heavily into…
Panel: Governance & Ethics
A moderated conversation closing the Governance & Ethics session. Andrew Murphy leads a discussion with Aubrey Blanche, Hamish Songsmith, and Nick Lothian on how principles, operational frameworks, and hands-on privacy…
Kill the God Agent
Your multi-agent system probably has one orchestrator with access to every tool, every database, every API. If that agent gets injected, the entire toolchain is compromised. Guardrails won't save you. In this session,…
Multi-Armed Bandits: The Scientific Shotgun for Evals
A/B testing is too rigid a tool for AI systems. You're stuck serving worse results for the duration of the experiment and getting billed for slower models while three providers release SOTA updates this week.
Steal a…
Afternoon break
Grab a bite to eat, some coffee, check out the hallway track, and connect with our wonderful partners in this extended break ahead of today's final session.
Agent Observability: Monitoring and Understanding Agents at Internet Scale
Agent usage is exploding (if you haven't noticed) with an unprecedented transformation in the activities of both developers and other roles creating enormous volumes of new autonomous, dynamic decision making programs…
Engineering for the Agentic Web When 50% of Your Traffic is Robots
Over the last two years, our customer web traffic changed: today around 50% of visitors were unknown browsers and AI agents. The era of aligning with the traditional search engine crawlers with Core Web Vitals is…
Regulatory AI: Building Intelligent Compliance into Financial Operating Systems
Regulatory AI represents the next evolution of financial systems, where compliance, risk, and governance are no longer external constraints, but embedded, intelligent capabilities within the platform itself.
This…
Our AI Hallucinated in Production: How We Fixed It With Evals
We shipped one of REA Group’s first generative AI features to production: Property Highlights, which turns long real-estate listings into three skimmable takeaways. The demo was easy; real traffic wasn’t—hallucinations…
The AI Control Plane: When Your Infrastructure Becomes the Context Window
We've spent a decade codifying infrastructure, Terraform, Pulumi, CDK. This session explores what happens when you treat infrastructure as a queryable data layer: exposing cloud state, Skills giving agents reusable…
19Cabs: 1115 drivers, 500 customers, 90 days from idea — and why we still had to stop and rethink AI
Balram Singh is a frontend-focused full-stack engineer and architect with over a decade of experience building web and mobile applications. He currently works at Publicis Sapient in Australia, delivering large-scale…
Treating Infrastructure as Data: Building an AI-Native Control Plane
StackQL provides a unified control plane data model for agents, tools, processes and humans to interact with. The StackQL MCP server exposes this unified interface to AI agents, allowing them to query, provision, and…
The Agentic Contract: A practical framework from enterprises shipping agents to production
Most enterprise AI agent projects stall between POC and production because teams can't answer a basic question: is the agent actually performing as expected? This talk shares the practical framework we've developed with…
The Application Layer Is the New Research Lab
In the pre-genAI era, vertical product teams handed insights to a separate R&D group, who shipped a new model two quarters later. That handoff is now a bug. Agentic systems are built from dozens of model calls, judges,…
Democratizing Frontier LLMs - Cloud-cluster scale intelligence running on any Desktop PC
Explore the engineering breakthroughs behind running state-of-the-art, ultra-large Mixture-of-Experts models (100B-600B+ parameters) entirely offline on consumer-grade desktop hardware. This session dives deep into our…
Orbital Lasers vs For Loops: Economically Matching Models to Tasks
Most developers pick their AI model the same way: use the biggest, smartest one available for everything. Bash script? Opus. Dockerfile? Whatever's at the top of the dropdown. Then they hit their usage limits halfway…
Your Agent Doesn't Like Your APIs
Every API you've shipped was designed for a human reading docs. Agents don't read docs - they load your entire tool schema into a context window every call, then burn tokens guessing which endpoint to try.
Take a…
Enabling Safe AI Experimentation for Non-Technical Founders
Inga Pflaumer shares how a pre-seed startup without technical leadership was able to quickly build and iterate on their product using AI without sacrificing safety, focusing on setting up the right foundations and…
Your AI Can’t Engineer (Yet)
Large language models excel at code—but engineering isn't just code. When you ask an AI to calculate short-circuit currents per IEC 60909 or size a pavement per Austroads 2022, you're asking it to operate outside its…
Agentic Self-Healing in Production
Your pipeline breaks at 2am. Nobody's watching. By morning, it's already fixed.
That's not wishful thinking — that's agentic self-healing in production.
In this talk, we'll explore how AI agents can monitor, diagnose,…
Panel: Case Studies
A moderated conversation closing the L2 Case Studies session. AJ Fisher leads a discussion with Theo Adis, Balram Singh, and Inga Pflaumer about what it really took to ship AI into live businesses across regulated…
Flue: The Agent Harness Framework
Flue is a programmable, open source agent harness, able to represent any autonomous agent or workflow, from simple chatbots to entire coding platforms.
In this talk we'll will touch on…
How Canva built an Agentic Support Experience using Langfuse Observability
At Canva, our support experience is powered by multiple AI systems, from real-time assistance to asynchronous ticket resolution that handles complex, multi-step workflows and escalates to humans when needed. In this…
Reception
At the close of day one, join your fellow attendees and speakers at Zinc for a drink, conversation, and more, courtesy of our partners Stile.
Speaker dinner
Leadership ticket holders join speakers at our speaker dinner. Generously hosted by Cloudflare.
Registration
Doors open at 8pm and we'll have the barista coffee flowing and our exhibitors ready to hand out swag and answer all your questions. So get in early, get registered, and get ready for two incredible days.
Welcome
Take a seat and get ready for two intense days of world-class speakers.
How to keep up in AI Engineering
Shawn Wang coined the term "AI Engineer" with his seminal 2023 essay "The Rise of the AI Engineer."
Few people are better placed to help us keep pace with a field that seems…
Token Town (why compute strategy is product strategy)
Token pricing is a noisy headline. What matters in production is what you pay per task once outputs get longer, retries creep in, and “good enough” models get deprecated.
This talk breaks down the current dynamics…
Everything Is a Factory
Software development as we knew it is dead. The old world of hand-crafting every line, endless hiring cycles, and fragile vertical stacks has been overtaken by AI-driven software factories that turn simple prompts into…
Three Lanes Below One Millisecond: A Rust SDK for Gemini Live
Full-duplex voice agents punish the choices text-only agents forgive. A 50ms GC pause is a perceptible glitch. A blocked event loop is a barged-in user talking over your model. Google’s Python ADK is a beautiful kit for…
Keynote outro
We'll wrap up the first session ahead of a well-earned break.
Lunch
We've got great food, coffee, and hallway track sessions.
Make sure you visit our partners, to learn more, get great swag, and win great stuff!
AI Hamsters Engineering: Circling Your Way to Success
Every AI engineer knows the loop — prompt tweaks, evals, regressions, repeat. It feels like going in circles. But that's not the problem. The problem is not knowing if each circle is tighter than the last. This talk is…
Beyond Silicon Valley: Building AI Governance on the Fair Go Principle
Responsible AI isn’t culturally neutral. American AI development embeds distinctly American values — individual liberty, technological solutionism, and winner-takes-all competition. But what happens when these values…
Evaluation Precedes Evolution: Rubrics as the Load-Bearing Infrastructure of Self-Improving Agents
The 2025–2026 wave of "self-evolving" agents — prompt-tuning loops, memory accumulation, agent swarms, GEPA, ReasoningBank — share a structure that is sometimes lost in the jargon: every one of them is hill-climbing on…
Fail fast, fix faster: Why faster AI models beat smarter ones
The smartest model doesn't always win.
In agentic coding loops, a model that is 10x faster but only marginally competent can often fail its way to success before a frontier model finishes reasoning.
AJ Fisher…
Optimising GenAI at Runtime with Experimentation and Guardrails
Generative AI systems evolve constantly, and the impact of prompt or model changes often isn't clear until real users interact with them in production. In this session, learn how teams using Amazon Bedrock safely…
Beyond Forgetful Bots: Architectural Patterns for Persistent, Proactive Claw-Style AI Agents
Most AI agents are reactive chatbots—great for one-off queries, but they reset, forget, and lack initiative, failing in real-world use like personal assistants or autonomous workflows.
This talk dives into the…
Building Frameworks Building Systems
CI infrastructure is ripe for the vibing — so why don’t we? I have been.
Our company must deliver 200 interactive science and maths games for millions of students this year. Only a few years ago were we making 30, by…
Stop Blocking, Start Building: Rethinking Governance for the Agentic Era
As AI moves from "chat" to "act," the risk surface is exploding. Traditional governance is too slow, too onerous, and often looks in the wrong places. Join this session to learn how to:
- Identify the 3 fatal flaws…
Shipping Sandboxed Workers for Notion Agents
In this talk, we will share how we built a platform at Notion that allows developers to extend AI agents with custom code. The system enables developers to write small programs that give their agents access to tools…
Why AI coding tools might not make the slightest difference
Most “AI gives you 10x productivity” stories assume coding is the bottleneck. For large and mature companies this is almost never the case, so you roll out AI coding tools, people feel faster, but delivery metrics…
The State of the AI Engineering Job Market in Australia
Jake Maloney is the founder of AI Jobs Australia, a job board dedicated to Artificial Intelligence, Machine Learning, Data Science, and AI-adjacent emerging roles. That vantage point gives him insight into the…
Who Needs a LoRA?
Can you faithfully edit hand-drawn illustrations using an image model you haven’t fine-tuned? Most practitioners assume you need a LoRA for style-faithful editing. I set out to prove otherwise — building a production…
Having your cake and eating it: An implementation guide for privacy with AI
Everyone wants privacy, but the best models require you to give up control of your data. What options are there for keeping data private but while still embracing the promise of AI?
In this talk we'll take a…
Close your agentic loop
Every time you've told an agent it broke the layout of your website, output the wrong schema or failed an invariant - you are the feedback loop. The teams achieving the best outcomes right now are focused on building…
Constitutional Prompting: Making AI Coding Agents Reliable Without the Iteration Tax
Every engineering team trying to automate developer workflows with AI agents hits the same wall: the iteration tax. You ask an agent to review a PR, scaffold a feature, or audit code quality — it does something almost…
Don't Be Cheap: AI and the Appearance of Engineering
Software engineering has a pattern: demanding practices arrive and get reduced to their ceremonies. Agile kept the standups, lost the discipline. DevOps kept the postmortems, lost the learning. The form survives. The…
How Many Agents Are Too Many? The Hidden Cost of Multi-Agent Systems
Multi-agent systems promise scalability and smarter reasoning—but in production, more agents often mean more cost, latency, and failure. This talk shares real-world engineering lessons, metrics, and architectural…
From Zero to Production: How 15 Engineers Shipped a Production LLM Product with AI Coding Tools
How a team of fewer than 15 engineers at MYOB took an AI-powered chat experience from zero to production, embedded directly inside the product serving real small business owners and accountants. Leaning heavily into…
Panel: Governance & Ethics
A moderated conversation closing the Governance & Ethics session. Andrew Murphy leads a discussion with Aubrey Blanche, Hamish Songsmith, and Nick Lothian on how principles, operational frameworks, and hands-on privacy…
Kill the God Agent
Your multi-agent system probably has one orchestrator with access to every tool, every database, every API. If that agent gets injected, the entire toolchain is compromised. Guardrails won't save you. In this session,…
Multi-Armed Bandits: The Scientific Shotgun for Evals
A/B testing is too rigid a tool for AI systems. You're stuck serving worse results for the duration of the experiment and getting billed for slower models while three providers release SOTA updates this week.
Steal a…
Afternoon break
Grab a bite to eat, some coffee, check out the hallway track, and connect with our wonderful partners in this extended break ahead of today's final session.
Regulatory AI: Building Intelligent Compliance into Financial Operating Systems
Regulatory AI represents the next evolution of financial systems, where compliance, risk, and governance are no longer external constraints, but embedded, intelligent capabilities within the platform itself.
This…
Agent Observability: Monitoring and Understanding Agents at Internet Scale
Agent usage is exploding (if you haven't noticed) with an unprecedented transformation in the activities of both developers and other roles creating enormous volumes of new autonomous, dynamic decision making programs…
Engineering for the Agentic Web When 50% of Your Traffic is Robots
Over the last two years, our customer web traffic changed: today around 50% of visitors were unknown browsers and AI agents. The era of aligning with the traditional search engine crawlers with Core Web Vitals is…
Our AI Hallucinated in Production: How We Fixed It With Evals
We shipped one of REA Group’s first generative AI features to production: Property Highlights, which turns long real-estate listings into three skimmable takeaways. The demo was easy; real traffic wasn’t—hallucinations…
The AI Control Plane: When Your Infrastructure Becomes the Context Window
We've spent a decade codifying infrastructure, Terraform, Pulumi, CDK. This session explores what happens when you treat infrastructure as a queryable data layer: exposing cloud state, Skills giving agents reusable…
19Cabs: 1115 drivers, 500 customers, 90 days from idea — and why we still had to stop and rethink AI
Balram Singh is a frontend-focused full-stack engineer and architect with over a decade of experience building web and mobile applications. He currently works at Publicis Sapient in Australia, delivering large-scale…
The Application Layer Is the New Research Lab
In the pre-genAI era, vertical product teams handed insights to a separate R&D group, who shipped a new model two quarters later. That handoff is now a bug. Agentic systems are built from dozens of model calls, judges,…
Treating Infrastructure as Data: Building an AI-Native Control Plane
StackQL provides a unified control plane data model for agents, tools, processes and humans to interact with. The StackQL MCP server exposes this unified interface to AI agents, allowing them to query, provision, and…
The Agentic Contract: A practical framework from enterprises shipping agents to production
Most enterprise AI agent projects stall between POC and production because teams can't answer a basic question: is the agent actually performing as expected? This talk shares the practical framework we've developed with…
Democratizing Frontier LLMs - Cloud-cluster scale intelligence running on any Desktop PC
Explore the engineering breakthroughs behind running state-of-the-art, ultra-large Mixture-of-Experts models (100B-600B+ parameters) entirely offline on consumer-grade desktop hardware. This session dives deep into our…
Enabling Safe AI Experimentation for Non-Technical Founders
Inga Pflaumer shares how a pre-seed startup without technical leadership was able to quickly build and iterate on their product using AI without sacrificing safety, focusing on setting up the right foundations and…
Orbital Lasers vs For Loops: Economically Matching Models to Tasks
Most developers pick their AI model the same way: use the biggest, smartest one available for everything. Bash script? Opus. Dockerfile? Whatever's at the top of the dropdown. Then they hit their usage limits halfway…
Your Agent Doesn't Like Your APIs
Every API you've shipped was designed for a human reading docs. Agents don't read docs - they load your entire tool schema into a context window every call, then burn tokens guessing which endpoint to try.
Take a…
Your AI Can’t Engineer (Yet)
Large language models excel at code—but engineering isn't just code. When you ask an AI to calculate short-circuit currents per IEC 60909 or size a pavement per Austroads 2022, you're asking it to operate outside its…
Agentic Self-Healing in Production
Your pipeline breaks at 2am. Nobody's watching. By morning, it's already fixed.
That's not wishful thinking — that's agentic self-healing in production.
In this talk, we'll explore how AI agents can monitor, diagnose,…
Panel: Case Studies
A moderated conversation closing the L2 Case Studies session. AJ Fisher leads a discussion with Theo Adis, Balram Singh, and Inga Pflaumer about what it really took to ship AI into live businesses across regulated…
Flue: The Agent Harness Framework
Flue is a programmable, open source agent harness, able to represent any autonomous agent or workflow, from simple chatbots to entire coding platforms.
In this talk we'll will touch on…
How Canva built an Agentic Support Experience using Langfuse Observability
At Canva, our support experience is powered by multiple AI systems, from real-time assistance to asynchronous ticket resolution that handles complex, multi-step workflows and escalates to humans when needed. In this…
Reception
At the close of day one, join your fellow attendees and speakers at Zinc for a drink, conversation, and more, courtesy of our partners Stile.
Speaker dinner
Leadership ticket holders join speakers at our speaker dinner. Generously hosted by Cloudflare.
Thursday June 4
AI Leaders Roundtable Breakfast
Kick off day two of AI Engineer Melbourne with an exclusive leadership round table breakfast, sponsored by Vercel. Built for senior AI engineers and the leaders driving AI engineering inside their organisations, it's a…
Expo open, coffee available
Why not drop in before the day starts for a coffee from 8am? Our expo will be open.
Welcome — Day 2
Welcome back to day two of AI Engineer, another day full of amazing talks and conversation.
Craft in the Time of Agents
You feel more productive than you’ve ever been. You put on the Iron Man suit and now you’re building things in hours that used to take weeks. And you’re exhausted by Wednesday. The craft that used to sustain you — the…
State of the AI Model Landscape
An analysis of the leading AI models and the real trade-offs between them across intelligence, speed, price, token usage, and beyond, grounded in Artificial Analysis' independent benchmarking. Includes a forward-looking…
Morning keynote outro
Lunch
Is it brunch? Is it an early lunch? Whatever you call it, We'll have great food, sessions in the hallway track, coffee, and more.
Build Agent-Powered Workflows on Notion Developer Platform
Notion's Developer Platform gives developers and coding agents the building blocks to programmatically build on Notion. Connect to external systems, bring context into a shared workspace, and take permissioned actions…
Spec driven AI development - A Real World Perspective
AI demos are easy. Production systems are not.
In this session, we move beyond hype and explore what it actually takes to deliver AI systems that work in the real world. Not experiments. Not playgrounds. Proper,…
Not Everything Needs an LLM
I got frustrated. My support tickets kept getting routed to the wrong team. Every misroute added a day to resolution. I decided to fix it.
The obvious approach in 2024 was to throw the problem at an LLM. I knew that…
Observability and Evaluation for LLM Apps and Agentic AI with Langfuse
Shipping an LLM app is easy. Knowing whether it's actually working is hard. Unlike traditional software, LLM systems and agentic pipelines can run without errors while producing wrong or degraded answers — and with…
Deploying AI at the Edge: Model Compression and Hardware-Aware Optimization
Large AI models often struggle to meet the latency, memory, and power constraints required for real-world edge deployments. This talk explores practical techniques for making modern AI models efficient enough to run…
When a Small Language Model Beat Our LLM in Production
Large language models are often the default choice for production AI systems, even when the task does not require broad reasoning or generative depth. In this talk, I will share a real production case where an LLM-based…
Building SDKs in the Agentic Era
In the time it takes to train a frontier model, the open source libraries we rely on can undergo significant changes. This creates an ongoing delta between what an LLM coding agent suggests and what the best practices…
The AI Tax and "legal" ways to minimise it
AI tools feel productive. That's the problem.
A 2025 METR study found experienced developers were 19% slower using AI on their own codebases, yet believed they were 20% faster. I didn't need a study to tell me…
AGENTS.md is the wrong conversation
AGENTS.md started as a simple way to guide coding agents, but many teams are discovering that a default or poorly written one can actually make agents worse. Obvious facts, vague rules, outdated guidance, and generic…
Multi-Model Collaboration with Claude Code: How to Measure What Actually Works
We built Claudish, a free open-source proxy that lets Claude Code work with any AI model. 15+ providers directly - Google, OpenAI, xAI, Kimi, MiniMax, and more. OpenRouter for even wider access. Or fully offline with…
The Software Engineer Who Don't Code
AI coding tools are rewriting what it means to build software, turning engineers into architects, reviewers, and orchestrators of AI-generated code rather than authors of every line. This talk explores the emerging…
Your engineers aren't afraid of AI. They're afraid of becoming junior again.
When you roll out AI coding tools, you expect pushback about job security and workflow disruption. What you get instead is something harder to fix: senior engineers watching AI produce in seconds what used to take them…
Edge AI with Direct Device Control
Despite all the hype and promise, we are in the Timeshare Mainframe moment of AI. Even our devices rely on the cloud for most inference. As AI moves beyond the cloud and into the physical world, the real opportunity…
Engineering without reading code
In 2024 my team built 2 web-based Interactives for our Science Curriculum. In 2025 we built 50, in 2026 we expect to build over 100. In 2024 Engineers collaborated with Writers to build Interactives. In 2025 Writers…
How to Get Fired as an AI Engineer
The fastest way to become irrelevant is to keep doing the job exactly as it was defined last year. AI hasn't removed the engineer from the loop; it has changed which loop is worth owning. This talk is about the new…
COBOL and AI: Building a Self-Serve Knowledge Layer for 2,000 Batch Jobs
Modernization planning stalls when the business rules are locked inside decades of COBOL code. This talk shares a practical, production‑tested playbook I used to extract those rules, make them explainable, and serve…
The Death of Documentation
For decades, documentation has been the "sacred bridge" between human intent and machine execution. Historically, this was born of necessity: when computer time was scarce, we had to document our plans perfectly before…
The Red Flags of Vibe Coding a Dating app
Move fast. Follow the vibe. Launch it.
That works… until you’re building a dating app.
Based on real world experiments building AI driven dating and community tools, this talk breaks down what actually happens…
Panel: Engineering Reality
A moderated conversation closing the L3 Engineering Reality session. AJ Fisher leads a discussion with Dave Hall, Krishna Mundada, and Andy Kelk on when not to reach for an LLM, the real costs of AI in production, and…
Legacy Software + Agentic Discovery
Legacy Software powers the world - from banking to utilities and government. The hardest part isn’t the code - we have the code.... it’s when the old guy with the beard leaves, and the knowledge walks out with him: what…
Vibe-coded Multiplayer Video Games
Every had a great idea for a video game but developing it always felt out of reach? AI is at a point now, where you can develop fully-fledged, multiplayer online games without needing to code. I'll take you through my…
Designing Inference-Native Systems
For a long time, the world has run on systems built on logic. You put something in, follow a set of rules, and you get an output. Now we have systems that can run on inference: systems that can update belief, decide,…
Afternoon break
As the conference winds down, grab a final coffee, or a last bit of swag.
Stop Rebuilding Authorisation: Accelerating AI Agent Development
The demand for AI agents that take real action is immense. However, developers keep hitting a frustrating bottleneck: building the complex authorisation logic required to let agents act on behalf of users. Rebuilding…
Agentic SAST: Building an AI Pipeline for Rule Synthesis and Root-Cause Vulnerability Analysis
Project Electrification is an agentic, AI-powered application security pipeline designed to eliminate vulnerabilities at their source.
Autonomous agents scan large codebases, generate and execute custom SAST rules, and…
Why LLMs Fall for Stories (And 5 Production Patterns That Actually Stop Them)
Prompt injection isn't a bug - it's a feature. LLMs trained on humanity's written corpus learned something we didn't intend: narrative structure. They understand dramatic tension, plot twists, and persuasive framing.…
From AI Survey to Production: What the Readiness Gap Actually Looks Like
Everyone talks about AI transformation. Almost no one talks about what happens when you survey your workforce and discover that executives and employees have completely different ideas about what AI readiness means - or…
Hacking the Model: AI Red Teaming in Practice
AI is already in production—but almost no one has tested how it breaks. Today I’ll show you how attackers think, how models are actually exploited—from prompt injection to data exfiltration—and how to systematically…
AI After an Apocalypse
Cloud outages used to mean your site went down, maybe you couldn't deploy. Just small unimportant stuff. Now an outage means you can't even write any code. And unreliable connections cause the same problems as ever -…
What We Learned Taking a Culture-First Approach to AI Adoption at scale
Most AI transformation stories focus on tooling, targets, and adoption curves. At Culture Amp, our primary focus was people and culture. We still wanted to drive and accelerate our impact, but we weren’t willing to…
Stop vibing your agents to production: applying ML discipline to agent development
When I joined my current team, it was a familiar pattern: 6-8 experiments over a year, each taking 10-12 weeks, 60-70% of the time burned on infrastructure, one thing in production held together with duct tape, and our…
Why Most AI De-Identification Fails in Production, And How We Built One Lawyers Actually Trust
De-identifying text is easy to demo and surprisingly hard to ship. This talk is a deep technical case study of building SmartScrub, a reversible de-identification system designed for legal workflows, where privacy…
Fully Automated Luxury Gay Space Engineering
Autocomplete is so 2023. Chatbots were already tedious by 2024. Running agents locally was cool... back in early 2025.
The future of engineering doesn't have a human in the coding loop at all.
When it's within…
Beat Burnout, Find Flourishing: The AI Edition
AI tools are an amplifier. They don't just amplify productivity, they amplify whatever dynamics already exist in a team.
Steve Yegge calls it the Dracula Effect: AI coding at full speed is vampiric, draining…
Are Your AI Agents Secure? Defending the Privileged Agent
Are the AI agents you're developing truly secure?
AI agents that execute actions autonomously offer unprecedented value. But what about the "privileges" granted to them to act "on behalf of the user"?
Improper…
Your Agents Pass Every Benchmark—Then Memory Breaks Them in Production
You add memory to your agent, it works great in testing, and you ship it. A few weeks later, outputs start getting worse and nobody can figure out why. The agent is pulling in old information that's no longer true,…
12TB of AI coding agent logs - what works, what fails
Three things matter for AI coding effectiveness: the tool, the codebase, the developer. When we look at the nuance of sessions, we can see patterns across all these - what works, what doesn't, what you can control, what…
Panel: Culture & People
A moderated conversation closing the L4 Culture & People session — and closing the Leadership track. Andrew Murphy leads a discussion with Dr Christian Dandre, Eric Grigson and Paul Hughes (Culture Amp), and Navin…
AI Agents Are Distributed Systems
AI agents aren’t magic. They’re distributed systems — with better marketing.
Behind every impressive demo is a messy reality: multiple tools, remote services, auth boundaries, latency, retries, side effects, and…
Slop is a standards problem
Your feed is full of warnings about an incoming tidal wave of AI slop. Unmaintainable code. Crushing tech debt. Anyone with a prompt and ten minutes shipping production code. The fear is real, but it misses what's…
AI Leaders Roundtable Breakfast
Kick off day two of AI Engineer Melbourne with an exclusive leadership round table breakfast, sponsored by Vercel. Built for senior AI engineers and the leaders driving AI engineering inside their organisations, it's a…
Expo open, coffee available
Why not drop in before the day starts for a coffee from 8am? Our expo will be open.
Welcome — Day 2
Welcome back to day two of AI Engineer, another day full of amazing talks and conversation.
Craft in the Time of Agents
You feel more productive than you’ve ever been. You put on the Iron Man suit and now you’re building things in hours that used to take weeks. And you’re exhausted by Wednesday. The craft that used to sustain you — the…
State of the AI Model Landscape
An analysis of the leading AI models and the real trade-offs between them across intelligence, speed, price, token usage, and beyond, grounded in Artificial Analysis' independent benchmarking. Includes a forward-looking…
Morning keynote outro
Lunch
Is it brunch? Is it an early lunch? Whatever you call it, We'll have great food, sessions in the hallway track, coffee, and more.
Build Agent-Powered Workflows on Notion Developer Platform
Notion's Developer Platform gives developers and coding agents the building blocks to programmatically build on Notion. Connect to external systems, bring context into a shared workspace, and take permissioned actions…
Not Everything Needs an LLM
I got frustrated. My support tickets kept getting routed to the wrong team. Every misroute added a day to resolution. I decided to fix it.
The obvious approach in 2024 was to throw the problem at an LLM. I knew that…
Deploying AI at the Edge: Model Compression and Hardware-Aware Optimization
Large AI models often struggle to meet the latency, memory, and power constraints required for real-world edge deployments. This talk explores practical techniques for making modern AI models efficient enough to run…
Spec driven AI development - A Real World Perspective
AI demos are easy. Production systems are not.
In this session, we move beyond hype and explore what it actually takes to deliver AI systems that work in the real world. Not experiments. Not playgrounds. Proper,…
Observability and Evaluation for LLM Apps and Agentic AI with Langfuse
Shipping an LLM app is easy. Knowing whether it's actually working is hard. Unlike traditional software, LLM systems and agentic pipelines can run without errors while producing wrong or degraded answers — and with…
When a Small Language Model Beat Our LLM in Production
Large language models are often the default choice for production AI systems, even when the task does not require broad reasoning or generative depth. In this talk, I will share a real production case where an LLM-based…
Building SDKs in the Agentic Era
In the time it takes to train a frontier model, the open source libraries we rely on can undergo significant changes. This creates an ongoing delta between what an LLM coding agent suggests and what the best practices…
The AI Tax and "legal" ways to minimise it
AI tools feel productive. That's the problem.
A 2025 METR study found experienced developers were 19% slower using AI on their own codebases, yet believed they were 20% faster. I didn't need a study to tell me…
Multi-Model Collaboration with Claude Code: How to Measure What Actually Works
We built Claudish, a free open-source proxy that lets Claude Code work with any AI model. 15+ providers directly - Google, OpenAI, xAI, Kimi, MiniMax, and more. OpenRouter for even wider access. Or fully offline with…
AGENTS.md is the wrong conversation
AGENTS.md started as a simple way to guide coding agents, but many teams are discovering that a default or poorly written one can actually make agents worse. Obvious facts, vague rules, outdated guidance, and generic…
The Software Engineer Who Don't Code
AI coding tools are rewriting what it means to build software, turning engineers into architects, reviewers, and orchestrators of AI-generated code rather than authors of every line. This talk explores the emerging…
Your engineers aren't afraid of AI. They're afraid of becoming junior again.
When you roll out AI coding tools, you expect pushback about job security and workflow disruption. What you get instead is something harder to fix: senior engineers watching AI produce in seconds what used to take them…
Edge AI with Direct Device Control
Despite all the hype and promise, we are in the Timeshare Mainframe moment of AI. Even our devices rely on the cloud for most inference. As AI moves beyond the cloud and into the physical world, the real opportunity…
Engineering without reading code
In 2024 my team built 2 web-based Interactives for our Science Curriculum. In 2025 we built 50, in 2026 we expect to build over 100. In 2024 Engineers collaborated with Writers to build Interactives. In 2025 Writers…
How to Get Fired as an AI Engineer
The fastest way to become irrelevant is to keep doing the job exactly as it was defined last year. AI hasn't removed the engineer from the loop; it has changed which loop is worth owning. This talk is about the new…
COBOL and AI: Building a Self-Serve Knowledge Layer for 2,000 Batch Jobs
Modernization planning stalls when the business rules are locked inside decades of COBOL code. This talk shares a practical, production‑tested playbook I used to extract those rules, make them explainable, and serve…
The Death of Documentation
For decades, documentation has been the "sacred bridge" between human intent and machine execution. Historically, this was born of necessity: when computer time was scarce, we had to document our plans perfectly before…
The Red Flags of Vibe Coding a Dating app
Move fast. Follow the vibe. Launch it.
That works… until you’re building a dating app.
Based on real world experiments building AI driven dating and community tools, this talk breaks down what actually happens…
Panel: Engineering Reality
A moderated conversation closing the L3 Engineering Reality session. AJ Fisher leads a discussion with Dave Hall, Krishna Mundada, and Andy Kelk on when not to reach for an LLM, the real costs of AI in production, and…
Legacy Software + Agentic Discovery
Legacy Software powers the world - from banking to utilities and government. The hardest part isn’t the code - we have the code.... it’s when the old guy with the beard leaves, and the knowledge walks out with him: what…
Designing Inference-Native Systems
For a long time, the world has run on systems built on logic. You put something in, follow a set of rules, and you get an output. Now we have systems that can run on inference: systems that can update belief, decide,…
Vibe-coded Multiplayer Video Games
Every had a great idea for a video game but developing it always felt out of reach? AI is at a point now, where you can develop fully-fledged, multiplayer online games without needing to code. I'll take you through my…
Afternoon break
As the conference winds down, grab a final coffee, or a last bit of swag.
Stop Rebuilding Authorisation: Accelerating AI Agent Development
The demand for AI agents that take real action is immense. However, developers keep hitting a frustrating bottleneck: building the complex authorisation logic required to let agents act on behalf of users. Rebuilding…
From AI Survey to Production: What the Readiness Gap Actually Looks Like
Everyone talks about AI transformation. Almost no one talks about what happens when you survey your workforce and discover that executives and employees have completely different ideas about what AI readiness means - or…
Why LLMs Fall for Stories (And 5 Production Patterns That Actually Stop Them)
Prompt injection isn't a bug - it's a feature. LLMs trained on humanity's written corpus learned something we didn't intend: narrative structure. They understand dramatic tension, plot twists, and persuasive framing.…
Agentic SAST: Building an AI Pipeline for Rule Synthesis and Root-Cause Vulnerability Analysis
Project Electrification is an agentic, AI-powered application security pipeline designed to eliminate vulnerabilities at their source.
Autonomous agents scan large codebases, generate and execute custom SAST rules, and…
Hacking the Model: AI Red Teaming in Practice
AI is already in production—but almost no one has tested how it breaks. Today I’ll show you how attackers think, how models are actually exploited—from prompt injection to data exfiltration—and how to systematically…
AI After an Apocalypse
Cloud outages used to mean your site went down, maybe you couldn't deploy. Just small unimportant stuff. Now an outage means you can't even write any code. And unreliable connections cause the same problems as ever -…
What We Learned Taking a Culture-First Approach to AI Adoption at scale
Most AI transformation stories focus on tooling, targets, and adoption curves. At Culture Amp, our primary focus was people and culture. We still wanted to drive and accelerate our impact, but we weren’t willing to…
Why Most AI De-Identification Fails in Production, And How We Built One Lawyers Actually Trust
De-identifying text is easy to demo and surprisingly hard to ship. This talk is a deep technical case study of building SmartScrub, a reversible de-identification system designed for legal workflows, where privacy…
Stop vibing your agents to production: applying ML discipline to agent development
When I joined my current team, it was a familiar pattern: 6-8 experiments over a year, each taking 10-12 weeks, 60-70% of the time burned on infrastructure, one thing in production held together with duct tape, and our…
Beat Burnout, Find Flourishing: The AI Edition
AI tools are an amplifier. They don't just amplify productivity, they amplify whatever dynamics already exist in a team.
Steve Yegge calls it the Dracula Effect: AI coding at full speed is vampiric, draining…
Are Your AI Agents Secure? Defending the Privileged Agent
Are the AI agents you're developing truly secure?
AI agents that execute actions autonomously offer unprecedented value. But what about the "privileges" granted to them to act "on behalf of the user"?
Improper…
Fully Automated Luxury Gay Space Engineering
Autocomplete is so 2023. Chatbots were already tedious by 2024. Running agents locally was cool... back in early 2025.
The future of engineering doesn't have a human in the coding loop at all.
When it's within…
Your Agents Pass Every Benchmark—Then Memory Breaks Them in Production
You add memory to your agent, it works great in testing, and you ship it. A few weeks later, outputs start getting worse and nobody can figure out why. The agent is pulling in old information that's no longer true,…
12TB of AI coding agent logs - what works, what fails
Three things matter for AI coding effectiveness: the tool, the codebase, the developer. When we look at the nuance of sessions, we can see patterns across all these - what works, what doesn't, what you can control, what…
Panel: Culture & People
A moderated conversation closing the L4 Culture & People session — and closing the Leadership track. Andrew Murphy leads a discussion with Dr Christian Dandre, Eric Grigson and Paul Hughes (Culture Amp), and Navin…
AI Agents Are Distributed Systems
AI agents aren’t magic. They’re distributed systems — with better marketing.
Behind every impressive demo is a messy reality: multiple tools, remote services, auth boundaries, latency, retries, side effects, and…
Slop is a standards problem
Your feed is full of warnings about an incoming tidal wave of AI slop. Unmaintainable code. Crushing tech debt. Anyone with a prompt and ten minutes shipping production code. The fear is real, but it misses what's…
Agents 18
Coding Agents 25
Evaluation & Reliability 9
MLOps & Operations 6
AI Engineering Foundations 5
Software Engineering Practice 6
Leadership & Strategy 6
Other 12
Flagship Partners
Premium Partners
Exhibiting Partners
Attending Partners
Agency Partners
Launchpad Partners
Community Partners
About Us
Web Directions has for 2 decades years brought together leading developers, engineers, visual, IxD, UX and product designers, Art and Creative Directors, product managers indeed everyone involved in producing web and digital products to learn from one another, and the World's leading experts across this vast field.
We promise attending one of our events will leave you significantly better versed in the challenges you face day to day, and in solutions for addressing them.
John Allsopp
John Allsopp has worked on the web for nearly 30 years, creating innovative tools like Style Master and X-Ray. His ideas laid the groundwork for Typekit (now Adobe Fonts) and Responsive Web Design itself.
His seminal 2000 essay "A Dao of Web Design" was cited by Ethan Marcotte as a key inspiration for Responsive Web Design and by Jeremy Keith as "a manifesto for anyone working on the Web." He's authored books including Developing With Web Standards, spoken at conferences worldwide, and brings deep expertise and passion to Web Directions
Code of Conduct
For over a decade, we've worked hard to create inclusive, fun, inspring and safe events for the Web Industry.
As part of our commitment to these values, we've adopted a code of conduct for all involved: ourselves, our speakers, our partners and our audience.
If you have any concern or feedback, please don't hesitate to contact us.