docs: draft agent-oriented linting paper by danielchen0 · Pull Request #67 · Create-Inc/laint

danielchen0 · 2026-05-17T01:00:36Z

Summary

Drafts an arXiv-style paper for laint around agent-oriented linting for generated JSX/TSX applications. The current draft frames laint as both an expert-curated benchmark and a feedback-loop tool for surfacing framework-specific generated-app failures before slower build, preview, device, or runtime checks.

The PR now includes checked-in raw prompt-grid artifacts, generated result tables, and a repair-loop pilot. The repair results are framed as diagnostic-feedback compliance signals: 476 -> 101 reported findings, 375 net reduction, 445 rule-level findings resolved, and 70 introduced findings across the repair loop. The paper still treats these as raw benchmark signals until human precision/recall labeling and downstream build/runtime/user-acceptance checks are added.

Verification

npm run lint
npm run build
npm run knip
npm test
npm run paper:tables
make -C paper
Checked paper/main.log for undefined refs/citations and overfull/warning/error lines
Verified PR review thread is resolved and CI is green

Remaining Before Submission

Add final author affiliations/contact metadata
Label findings for precision and recall, or explicitly publish as an unlabeled pilot
Pair laint outcomes with typecheck/build/preview/runtime/user-acceptance labels
Decide whether to keep this as arXiv-only or target a workshop format too

danielchen0 added 18 commits May 16, 2026 18:00

docs: draft agent-oriented linting paper

b8b6e69

docs: format paper readme

3111272

docs: include mobile app framing in paper

da9695f

docs: describe prompt-to-code eval

4624ac9

docs: add prompt grid eval harness

286679d

docs: add preliminary eval numbers

70afa7b

docs: frame laint as llm benchmark

1be8c14

docs: clarify laint benchmark framing

fba83fd

docs: describe benchmark behavioral signals

d3146e5

docs: add Arnav Surve as paper author

759ddfd

docs: rename paper validity section to limitations

5be1551

docs: clarify local heuristic tradeoff

c72b770

docs: clarify paper terminology

2504637

docs: add recall to detector metrics

22fed69

docs: add f-score detector metric

e162974

docs: pin paper benchmark version

c74d8f0

docs: clarify rule category source

f9a403f

docs: make paper numbers reproducible

fb7af44

arnavsurve reviewed May 17, 2026

View reviewed changes

Comment thread paper/main.tex

danielchen0 added 7 commits May 16, 2026 22:34

docs: archive full prompt grid artifact

182c08f

docs: highlight edit-time repair loop

0c5d616

docs: add expanded grid data to paper

afa6a41

docs: add generated result tables to paper

77b4992

docs: add repair loop pilot to paper

d0ca77f

docs: frame repair loop as diagnostic compliance

e7da143

docs: tighten benchmark pilot framing

5e7eae4