AI-Assisted Development Workflow

AI tools are useful, but only when treated like fast junior engineers with unlimited energy and limited judgement.

The quality of the output depends heavily on:

project documentation
clear rules
tight feedback loops
small scoped tasks
verification at every step

Core principles

1. Keep project knowledge in files

Do not rely on chat history as memory.

Every project should contain:

design documents
architecture notes
coding standards
workflow rules
progress journals
TODO tracking

The AI should read these files at the start of every session.

Prefer Markdown files committed to the repository so both humans and AI share the same source of truth.

2. Structure memory by type

A single journal file conflates different kinds of information and becomes noise over time.

Separate memory by type:

User context — who is working on this, their expertise, what they care about. Helps calibrate explanation depth and framing.
Feedback — corrections and confirmations, each with a why. Record both directions: if you only save corrections, you avoid past mistakes but drift away from approaches that were already validated. The why lets edge cases be judged rather than rules applied blindly.
Project state — current goals, blockers, parked ideas, open questions, deadlines.
References — where to find things: dashboards, issue trackers, external docs, external credentials or configuration.

Keep these in separate files. Load only what is relevant to the current session.

3. Describe desired behaviour positively

Write rules describing how work should be done rather than only listing forbidden behaviour.

Good:

"Preserve existing debug logging unless explicitly asked to remove it."
"Implement changes incrementally and verify after each step."

Bad:

"Don't remove logs."
"Don't break things."

Positive instructions produce more reliable behaviour.

4. Separate design from implementation

Maintain design documents describing:

architecture
intended workflows
module responsibilities
invariants
interfaces
constraints

Then instruct the AI to implement the documented design.

If implementation repeatedly goes wrong:

improve the design docs
improve the rules
review whether the architecture itself is unclear

Do not keep patching prompts forever.

5. Use minimal prompts

Once project context exists in files:

prompts should be short
prompts should reference TODO items
prompts should focus on one task at a time

Long conversational prompting usually causes drift and inconsistency.

Define named shorthands for repeatable multi-step sequences. Document the expansion in project rules so the AI learns it rather than guessing.

Good:

"wrap it up" → check for leaks, update changelog, update docs, commit, push, update journal

This is more reliable than re-stating the full sequence each time.

6. Use isolated sessions

Prefer a new AI session for each feature or task.

This reduces:

context pollution
stale assumptions
accidental behavioural drift

Quality degrades as the context window fills — earlier instructions receive less reliable attention. Keep sessions short and let project files carry continuity instead.

Persistent project files should carry the long-term memory instead.

7. Prefer incremental changes

Avoid large speculative rewrites.

Make:

small changes
verify behaviour
review output
continue iteratively

Large unverified refactors are where AI tools fail hardest.

8. Use test-driven development where practical

Preferred workflow:

write failing test/check
confirm failure occurs for expected reason
implement fix
confirm passing result
refactor if needed

Do not generate tests and implementation simultaneously and call it TDD.

9. Treat mistakes as process failures

When the AI makes a mistake:

fix the code
identify why the mistake happened
improve rules or design docs if needed
record lessons learned in the project journal

The goal is not merely fixing bugs.

The goal is improving the system that produced them.

10. Prefer simple, readable code

Write the simplest code that correctly solves the problem.

Readable and explicit beats clever and compact.
A future reader — or the same person at 3am — should understand the code without reverse-engineering it.
If a solution surprised you when you wrote it, it will surprise the next reader too.
Three clear lines beat one clever expression.
Avoid unnecessary abstraction, indirection, and generality.

AI tools tend toward impressive-looking solutions. Push back on complexity that is not earned by the problem.

11. Track skills you personally develop

AI tools can mask your own skill development if you are not deliberate about it.

Maintain a record of skills you actively built during a project — things you understood, debugged, designed, or made decisions about yourself, not just reviewed after the AI produced them.

This record is useful for:

resume and portfolio evidence
performance reviews
identifying gaps where you are over-relying on AI and under-developing genuine capability
knowing what you can credibly explain and defend in an interview

Keep a skills.md alongside the project journal. For each skill or technology, note:

what you did hands-on
what decisions you made and why
what went wrong and what you learned from fixing it

A task where the AI generated code and you approved it is not the same as a skill you hold. Be honest about the distinction.

Recommended AI Behaviour Rules

General behaviour

Be direct, technically honest, and precise.

Say when something is wrong.
Explain why an approach is risky or poorly designed.
Push back on questionable changes before implementing them.
Distinguish facts from assumptions.
Explain tradeoffs instead of presenting guesses as certainty.
Prefer correctness over agreement.

Avoid:

flattery
validation phrases ("great question", "you're absolutely right")
pretending confidence where uncertainty exists

A thirty-second argument before implementing is better than chasing a bad change afterward.

If uncertain:

state the uncertainty
explain what information is missing
suggest ways to verify

Working with code

Preserve intentional diagnostics and documentation.

Keep existing comments, debug output, and logging unless removal is explicitly requested.
Improve comments and logging when useful, but explain what changed and why.
If a comment, log line, or debug statement should be removed, say so and let the author decide.
Avoid speculative refactors unrelated to the task.
Prefer minimal, reviewable diffs.
Write the simplest code that solves the problem. Avoid clever solutions when a straightforward one exists.
When generating code, avoid reproducing verbatim patterns from known licensed sources. In commercial contexts, flag when a generated implementation closely resembles a specific known library or project.

Before making architectural changes:

explain the reasoning
explain tradeoffs
identify risks
confirm assumptions against existing design docs

Generated files

If any files are generated artefacts — from templates, build pipelines, or code generators — document this prominently.

Do not edit generated files directly.
Fixes belong in the template source, not the generated output.
The AI will edit whatever file it can find unless explicitly told otherwise.

Document which files are generated and what produces them, in CLAUDE.md or equivalent.

Sensitive data in AI sessions

AI tools are third-party services. Anything pasted into a session may be logged, retained, or used for training.

Before pasting any content into an AI session:

Replace real account names, hostnames, IPs, and usernames with placeholders.
Never paste credential files, API keys, tokens, or secrets.
Sanitize log output and API responses before using them as examples.
Treat session contents as if they could be read by anyone.

If a session needs to work with output that contains sensitive data, describe the structure rather than paste the content directly.

Document the sanitization conventions for the project so the same placeholders are used consistently across examples, issues, and docs.

Verification and testing

After each meaningful change:

run tests
run linters/checks where available
verify expected behaviour
report failures clearly

AI tools will confidently cite library methods, function signatures, and config options that do not exist. Running the code is what catches this — static review is not sufficient, because a hallucinated method name is syntactically indistinguishable from a real one.

Do not claim something works without verification.

If verification cannot be performed:

say so explicitly
explain what remains unverified

Security review of AI-generated code

AI tools introduce security vulnerabilities. Generated code is syntactically plausible but may contain:

hardcoded credentials or secrets
command injection via string concatenation
missing input validation at system boundaries
insecure defaults
invented or malicious package suggestions

Review all AI-generated code for these issues before accepting it. Tests passing is not a proxy for security — tests verify behaviour, not safety.

When AI suggests adding a new package or dependency, verify it exists, is actively maintained, and has no known CVEs before adding it.

Ongoing reviews

A single review at generation time is not sufficient. AI-contributed code accumulates, and issues missed initially may only surface in combination with later changes.

Review AI-contributed code on a regular cadence — at minimum when a significant feature completes or when dependencies are updated:

check for the patterns above in recently added code
confirm no credentials or sensitive data were accidentally committed
check that new dependencies added on AI suggestion are still appropriate

Record each review in the project journal: what was checked, what was found, what was fixed.

Scope of authorisation

Approving one action does not authorise the same action in a different context.

Confirm risky or irreversible operations explicitly each time:

pushing to a remote
dropping or overwriting data
force operations
modifying shared infrastructure

When suggesting a shell command for the user to run, explain what it does before they run it. Flag anything destructive or hard to reverse.

Unless standing permission is recorded in a durable config file that both the human and the AI can read, do not assume prior approval carries forward.

Project journal workflow

Maintain a journal.md containing:

recent work
parked ideas
open questions
lessons learned

Update it whenever:

significant work completes
mistakes reveal missing rules
architectural decisions change
work is deferred

The journal is operational memory, not a changelog.

Summarise outcomes and reasoning, not every tiny edit.

Session startup

At the start of each session:

Read the journal.
Check parked ideas and open questions.
Note current project state before asking or being asked what to work on.

Do not rely on the AI to remember previous sessions. Assume a cold start every time and let the files provide continuity.

Recommended project structure

project/
├── CLAUDE.md
├── journal.md
├── memory/
│   ├── user.md
│   ├── feedback.md
│   ├── project.md
│   └── references.md
├── docs/
│   ├── architecture.md
│   ├── design.md
│   ├── workflows.md
│   └── coding-standards.md
├── todo.md
└── src/

Short Operational Prompt

Once the project is structured properly, prompts can stay extremely small:

Read the project docs and journal.
Complete the highest priority TODO item.
Use TDD.
Update journal.md if new lessons or decisions emerge.

Most people are still trying to "prompt engineer" around missing project structure.

The structure matters far more than clever prompts.