alt.hn

4/7/2026 at 2:33:48 PM

Show HN: Finalrun – Spec-driven testing using English and vision for mobile apps

https://github.com/final-run/finalrun-agent

by ashish004

4/7/2026 at 5:45:30 PM

This solves a massive headache. The drift between externally generated tests and an active codebase is a brutal problem to maintain.

Using vision-based execution instead of brittle XPaths is a great baseline, but moving the test definitions to live directly alongside the repo context is definitely the real win here.

Did you find that generating the YAML from the codebase context entirely eliminated the "stale test" issue, or do developers still need to manually tweak the generated YAML when mobile UI layouts change drastically? Great project!

by avikaa

4/7/2026 at 5:53:17 PM

Hi Avikaa, finalrun provides skills that you can integrate with any IDE of your choice. You can just ask the finalrun-generate-test skill to update all the test for your new feature.

by ashish004

4/7/2026 at 6:21:20 PM

  > The shift for me was realizing test generation shouldn’t be a one-off step. Tests need to live alongside the codebase so they stay in sync and have more context.

Does the actual test code generated by the agent get persisted to project?

If not, you have kicked the proverbial can down the road.

by gavinray

4/7/2026 at 6:37:09 PM

Yes gavinray, It gets persisted to the project. Its lives alongside the codebase. So that any test generated has the best context of what is being shipped. which makes the AI models use the best context to test any feature more accurately and consistently.

by ashish004

4/7/2026 at 10:59:46 PM

Just updated README.md, it's lot simpler and addresses on the core. Thanks for the feedback, please checkout

by ashish004

4/8/2026 at 5:02:06 AM

Verification of AI generated code right would be dope.

We do something similar in our company for web with playwright but facing a lot of flaky tests.

Will check this out

by usual_engineer

4/8/2026 at 5:07:44 AM

Hey, thats true, verification of ai generated code needs proof with video of each action, console logs and network logs. Would love to know how you are solving for web. would be a great learning for me too.

by ashish004

4/8/2026 at 1:36:14 AM

I just ran my first test. Thanks team :)

by srinidhigs829

4/8/2026 at 2:36:59 AM

Do share us feedback.

by ashish004

4/7/2026 at 2:57:17 PM

Looks pretty cool. How does your agent understand plain english?

by arnold_laishram

4/7/2026 at 3:06:34 PM

We have built a QA agent that can understand your plain english intent and uses vision to reason and navigate the app to test your intent. You can check our benchmark here https://finalrun.app/benchmark/ and how we architected our agent for the benchmark https://github.com/final-run/finalrun-android-world-benchmar.... Its all open source

by ashish004

4/7/2026 at 5:01:26 PM

Agentic testing. Kudos to your decision to open-source it!

by sahilahuja

4/7/2026 at 5:45:17 PM

[dead]

by ashish004

4/7/2026 at 5:59:54 PM

[dead]

by rootally7

4/7/2026 at 3:19:45 PM

[dead]

by arbaaz