Product Designer · Work Sample 2026

Close the books before month-end — not after it.

A push-first transaction review loop that meets the SMB owner in the moment a charge happens, lets AI handle the obvious, and keeps the accountant out of email jail.

<5%
Uncategorized $ at EOM (target)
~30s
To resolve a flagged charge
Clients one accountant can hold
0
Emails between Casey & Avery
Role
Sole product designer (research → IxD → visual)
Timeline
4-hour design exercise · May 2026
Surface
iOS push + mobile review + Casey web dashboard
Constraint
Real partner APIs (Ramp, Amex, PayPal) where possible
01 · The problem

Month-end is the wrong moment to ask Avery anything.

By the time Casey opens her review queue, the receipts have scattered, Avery is on location, and the context behind a $524 Amazon charge has evaporated. The conversation that follows — email, text, voicemail — is a tax on both of them.

Today — friction-heavy
Pile up · ask later · miss EOM
  1. 1
    Casey

    Transactions accumulate quietly all month

  2. 2
    Casey

    EOM hits — Casey reviews a pile she can't categorize alonepile-up

  3. 3
    Casey

    Emails + calls Avery for every ambiguous chargecontext-loss

  4. 4
    Avery

    Sees the email two weeks later, on set, can't recall the purchaseno-memory

  5. 5
    Avery

    Half-answers, drops thread, never uploads the receiptno-response

  6. 6
    Casey

    EOM slips. Both sides frustrated. Repeat next month.EOM-miss

Leafly — real-time push
Catch the charge while the memory is hot
  1. 1
    Leafly API

    Amex / Ramp / PayPal / Amazon webhook fires within seconds

  2. 2
    AI

    ≥95% confidence → auto-categorize. <80% → flag for Avery.

  3. 3
    Avery

    Lock-screen push the same hour: 'Amazon $524 — production gear?'30s

  4. 4
    Avery

    One tap to confirm. Or delegate to Maya. Or attach a photo.

  5. 5
    System

    Quiet escalation if 3 days pass — SMS, then email card, then Casey.

  6. 6
    Casey

    EOM arrives at 98% done. Casey reviews exceptions, not the pile.

02 · Who I'm designing for

Two people, opposite contexts, one shared deadline.

C
Casey · The accountant
Books for 8 SMB clients. Drowning in unfamiliar charges.
If I have to chase Avery one more time, I'm dropping Lion Snacks.
  • See what's actually blocking close, not every transaction
  • Stop being the human reminder system
  • Carry more clients without working weekends
A
Avery · The SMB owner
Runs Lion Snacks, a 6-person film production company. On location 4 days a week.
I'll answer when I have a second. I always mean to.
  • Be asked in the moment, not three weeks later
  • 30 seconds, lock-screen, no app gymnastics
  • Hand stuff off to Maya when she ran point on the shoot
03 · Strategy

Four layers that compound — not one silver bullet.

The current loop fails in four different places, so the fix is four connected moves. Each one stands alone, but the leverage comes from running them as a system.

Layer 1

API auto-pull · eliminate AI guessing

Where Leafly has a partner connection (Ramp, Amex Business, Amazon Business, PayPal webhook), pull line-items, not totals. Gaffer tape $48, Sandbags $138 — fact-based categorization, not pattern matching.

Impact~60% of ambiguity disappears before AI is involved
Layer 2

Push the SMB owner — not the accountant

Ambiguous charges fire a lock-screen push to Avery within minutes, with the top AI-suggested category pre-filled. The reply path is one tap, while the memory is still hot.

Impact~90% drop in Casey → Avery messaging
Layer 3

Escalate gracefully — never one-channel-fail

No reply in 24h → SMS magic link. 48h → email action card (Stripe-style, reply inline). EOM−5d → Casey gets a single 'needs your attention' tile. Push fatigue is real, so unrelated charges batch into a daily digest.

ImpactChannel waterfall, not channel spam
Layer 4

Casey ops at portfolio scale

Casey's dashboard shows clients by % categorized, not by alphabetical order. Exception queue surfaces only what AI + Avery couldn't resolve. The week-of-EOM view is a single screen.

ImpactOne accountant comfortably holds 10+ clients
The escalation waterfall

If the push misses, the system tries again — without Casey.

Step 1T+0
Push

Lock-screen with AI suggestion. Tap to confirm.

Step 2+24h
SMS · magic link

Browser-based reply for Avery's teammates without the app.

Step 3+48h
Email · action card

Reply buttons inside the email body. No app jump.

Step 4EOM−5d
Casey · last resort

A single tile on Casey's dashboard, only for the unresolved.

AI confidence tiers

What gets auto-handled vs. what gets a human.

95–100%
Repeats with strong history — auto-categorize, no human
Squarespace · Delta · Adobe
Auto
80–94%
Strong AI guess — Casey one-click approves in her queue
First-time merchant, recurring amount
Casey
<80%
Push Avery — memory is the only source of truth
PayPal to a person, new payee, large amounts
Avery
03.5 · The architecture underneath

From categorization software to operational memory.

The four-layer strategy is the shape of the product. Underneath it is an LLM-native architecture that changes the question accounting software is even asking — from "how do we categorize transactions?" to "how do we preserve financial context before it disappears?"

Current SaaS accountinglinear · lossy
  1. 1

    Transaction

  2. 2

    Rule / OCR

  3. 3

    Categorization

  4. 4

    Human review

  5. 5

    Month-end cleanup

Context is gathered weeks after the fact, through email.
AI-native accounting agentlooped · compounding
  1. 1

    Transaction

  2. 2

    Context ingestion (bank + email + receipt + history + org memory)

  3. 3

    LLM reasoning layer

  4. 4

    Conversational clarification

  5. 5

    Adaptive escalation (push · SMS · delegate)

  6. 6

    Continuous memory + audit trail

Context is captured the moment it exists, then compounded.
Six layers of the agent

Each layer earns the next one.

Layer 1

Context ingestion engine

Bank feeds, business cards, PayPal metadata, email receipts, OCR uploads — fused with vendor history, prior approvals, location & timing, recurring project patterns.

AI reasons using context, not just merchant names.
Layer 2

LLM reasoning engine

Not 'Uber → Travel'. Instead: 'this resembles prior production transportation approved by Avery during late-night studio shoots.' Confidence-scored, evidence-linked.

Categorization with a reason, not just a label.
Layer 3

Conversational accounting UX

Avery can ask: 'Why is this flagged?' · 'Split this expense.' · 'Find the receipt.' · 'Who handled this last time?' Human language in, accounting structure out.

The interface is a conversation, not a form.
Layer 4

Adaptive communication agent

Tone shifts with stakes — urgent at EOM, casual mid-month, delegate-aware when Maya is the right asker. Replaces reminders with judgment.

Right channel, right tone, right person.
Layer 5

Organizational memory graph

Maya → props. Jordan → lighting. Recurring Airbnb → crew lodging. Late-night Uber near studio → production transport. Each client converges on its own model.

Leafly learns how this team spends.
Layer 6

Continuous learning loop

Every confirmation, correction and delegation reweights the model. Manual review drops month over month as confidence compounds.

M1 ~68% manual → M12 ~7% manual.
Explainable AI · audit trail

Every auto-categorization ships with its reasoning.

Casey doesn't have to trust a black box — she can read why. That's what unlocks accountant confidence, compliance, and the right to reduce human review.

  • Accountant trust
  • Auditability
  • Compliance transparency
  • Explainable automation
Categorized · Amazon $524
Production Supplies · 93% confidence
Reasoning
matched 7 prior production purchases
amount within expected range
approved previously by Avery
Gmail receipt confirms equipment items
vendor historically categorized as production
Positioning shift
Before

Accounting automation tool.

After

AI-powered operational memory system for SMB finance teams.

04 · Key screens

Four moments that make the loop feel inevitable.

The full mobile flow is interactive here. Below: the three moments that decide whether this whole strategy works, plus Casey's portfolio view.

9:41
Tuesday, May 14
9:41
1 · The push lands while Avery is still on set — one tap to confirm.
9:41
2 of 4
Needs your eyes · 2 min ago
$524.00
Amazon · Visa ••4821
Tue, May 14 · 11:42 AM · Brooklyn, NY
Auto-parsed from receipt
Gaffer tape (3)$48.00
Sandbags 25lb (4)$138.00
C-stand 40"$289.00
Shipping$49.00
Leafly suggests92% confidence
2 · Tap-through opens the review screen, with line-items already pulled and the AI suggestion pre-selected.
9:41
Ask the team

Who can confirm this?

Maya was on the May 12 shoot. Likely she ordered this.

M
Maya Castillo
Producer · on May 12 shoot
Strong match
J
Jordan Liu
AD · on May 12 shoot
Possible
R
Reese Park
DP
Maya gets an SMS with a one-tap link. She doesn't need the Leafly app — the response opens in her browser.
3 · Don't know? Delegate to Maya — she gets her own magic-link SMS, no app install required.
4 · Casey's portfolio view
leafly.app/close
May · 11 business days to close

Close Dashboard

On-track to close
9/ 12
+4 vs last month
Auto-handled · May
237/ 248
96% — +26 pts since rollout
Owner-reply backlog
94pending
2 owners flagged
Avg time-to-confirm
42sec
−4m 18s vs email loop
Categorization rate
Last 9 months · portfolio avg
62% → 96%
Sep
Oct
Nov
Dec
Jan
Feb
Mar
Apr
96%
May
Rollout shipped Feb — the curve sells itself.
Auto-handled · daily volume
Charges resolved without human touch · 14 days
+74% MoM
2 wks agotoday251121
AI auto Owner replypeak 251 · today 248
12 closes · this month
Sorted by risk
ClientProgressAutoStatusOpen
Lion Snacks Inc.Just closed
100%
96%
Closed
Hawk Bicycles
94%
91%
On track
3
Mira Coffee Co.
100%
97%
Closed
Bridge Capital
89%
86%
On track
7
Field & Form
97%
93%
On track
2
Marigold Studio
41%
38%
At risk
22
Pico Roasters
28%
24%
Blocked
31
Hollow Mtn Co.
100%
95%
Closed
Bayside Yoga
78%
81%
On track
6
Trellis Floral
100%
94%
Closed
Indigo Press
65%
72%
On track
9
Quartz Brewing
52%
49%
At risk
14
Recent activity
Live
  • Avery · Lion Snacks
    Confirmed Amazon $524 as Props & Set
    2 min ago
  • Maya · Lion Snacks
    Uploaded receipt for PayPal $1,850 (N. Reyes)
    8 min ago
  • Avery · Lion Snacks
    Delegated PayPal $1,850 → Maya via SMS
    14 min ago
  • Leafly AI
    Auto-categorized 18 charges for Hollow Mtn Co.
    1h ago
  • Casey · You
    Approved Bridge Capital monthly close
    2h ago
  • Leafly AI
    Escalated Pico Roasters · 3 owner-replies stalled
    3h ago
0 emails sent this month — every reply happened in-app.

Clients sort by close-completeness, not alphabet. The progress bar decomposes the month so Casey can see at a glance how much Leafly handled, how much Avery cleared, and what's still waiting.

Open full dashboard →
05 · What I'd test & build next

The honest gaps in this exercise — and how I'd close them.

Validate the API assumption

Amazon doesn't have a public consumer Order API. Phase 1 would lean on Gmail/Outlook receipt parsing; Amazon Business is the real long-game integration.

Push batching, not push spam

Active production weeks could generate 10–20 charges/day. Anything sub-$500 from a known merchant batches into a daily digest. Only new payees or large amounts interrupt.

Delegation has its own loop

Right now a delegated charge could become a second black hole. Maya gets an accept/decline; if she's silent for 24h, the ball returns to Avery automatically.

Onboarding is a separate design

Connecting Ramp, inviting Maya, choosing what auto-categorizes — there's a whole first-run flow I'd want a sprint for, not a screen.

Measure the right thing

Target should be 'uncategorized $ at EOM < 5% of total spend' — not count of transactions. Need a baseline measurement before we can claim a delta.

The AI feedback loop

When Avery corrects a suggestion, that signal should reweight the merchant + amount + time-of-day pattern for this specific client. Each client converges on its own model.

Try it

The fastest way to understand this design is to tap through it.

Six screens, two minutes — Avery gets a push and resolves a charge without ever opening her email.

Walk through the prototype