ORF-RE11 Project Team Meeting Keynote
March 6, 2026
Key Statistics
Train with DNA, deploy with images only
Image courtesy of Gong et al. 2025

Cost Comparison
Image courtesy of Gong et al. 2025
Image courtesy of Lesperance et al. 2025
Note
Image courtesy of Lesperance et al. 2025
“How should I be safely integrating AI into my workflows?”
This is not about tomorrow.
This is about right now.
The pace of change is unprecedented.

Key Finding
“The length of tasks AI can do is doubling every 7 months” - METR Study
(Time Horizon 1.1 update suggests the rate may be accelerating)
Image courtesy of METR 2025, updated 2026
A fair criticism of the METR study is that it measures software tasks almost exclusively.
GDPval (OpenAI, Sep 2025) evaluates AI on real professional deliverables across 44 occupations and 9 industries — including manufacturing, healthcare, and finance.
Human experts blindly compare AI output against professional work.
Key Finding
AI output quality vs. human experts: 12% → 83% in under two years

“We might all find ourselves struggling to hold on to money, influence, even relevance. This new world could be more friendly and humane in many ways, while it lasts… But humans would be a drag on growth.”
— David Duvenaud, The Guardian, “Better at everything: how AI could make human beings irrelevant”
Organizations that don’t adapt to AI will be displaced by those that do
Avoiding AI accelerates displacement
Three pillars for thriving in the AI era:

Image: Aspuru-Guzik Group, UofT | Video: Global News / Sinton Lab
“I can feel my usage of Google search taking a nosedive already. I expect a bumpy ride as a new economic model for the Web lurches into view.” - Simon Willison




Quality UI/UX, easy export to other formats
Image courtesy of Elicit
“Catnip for programmers”
- Armin Ronacher

Videos courtesy of Simon Willison and Kushagrasikka
“Developers will be empowered to keep work queues full in large fleets of coding agents” - Steve Yegge
Video courtesy of All Hands AI
Graham Neubig from All Hands AI suggests these key skills:
| Imitation (SFT) | RLHF | Why the difference matters |
|---|---|---|
| Model copies full human output distribution. That includes occasional mistakes and mediocre phrasing. Ceiling ≈ human average. | Model samples its own answers and a human simply picks the better one. Humans don’t have to create perfection, only recognise it. | Humans are far stronger critics than creators. Preference-grading lets them steer the model away from the left tail (bad answers) and pull the whole distribution rightward. |
| Training signal = “produce exactly what a human would have written.” | Training signal = “move towards whichever candidate the human preferred.” | Over many iterations the reward model keeps nudging the policy towards the best-judged answers, eventually surpassing the median human. |
Image courtesy of Ayers et al. 2023
No “Taste 101” course - learn by doing
MYTH: Junior developers are doomed
REALITY: You’re best positioned to succeed
“Junior devs are vibing. They get it. The world is changing, and you have to adapt. So they adapt!”
“It’s not AI’s job to prove it’s better than you. It’s your job to get better using AI.” - Steve Yegge, “Revenge of the Junior Developer”
Study details:
Image courtesy of Si et al. 2025
Rule 1
“AI can be used to avoid learning, and AI can be used to assist learning”
Rule 2
“It’s ok to ask AI to do things you already know how to do,
but don’t ask AI to do things that you don’t know how to do”
Finding the right balance between learning with AI and automating with AI
Unprecedented change - AI capabilities doubling every 7 months
Avoiding AI accelerates displacement - Engage strategically instead
Develop your taste - The ability to discriminate quality remains crucial
Junior professionals have advantages - Adaptability + experience
Use AI to assist learning, not avoid it - Build skills while automating