Hacker News new | past | comments | ask | show | jobs | submit

Show HN: Simplex: Automate browser workflows using code and natural language

https://www.simplex.sh/playground
Would you mind sharing the story behind your pivot from on-demand photorealistic vision datasets[0] to browser automation?

[0] https://www.ycombinator.com/launches/Lbx-simplex-on-demand-p...

loading story #42709788
loading story #42708698
Apparently a simplex is a concept in geometry. I'm guessing this is the intended meaning of the term in the name. When most decently well informed, native speakers of English hear "simplex," though, they are likely to hear "herpes simplex" and so are likely to wonder, as I did, whether, or why, the program is named after a sexually transmitted disease.
loading story #42709452
loading story #42710363
loading story #42709970
loading story #42708880
Great demo, straight to the point. It might be nice to have some kind of feedback mechanism if it can't find the element on the page, or if it's partially cut off. For example, I changed the GitHub profile to my own (https://github.com/spro) in the example and it doesn't scroll down far enough for the whole image. I imagine in general it would be nice to scroll to an element ID (or even element description using the vision models) instead of a hardcoded value.

Side-note: The comment for the frequency graph is wrong, it mentions stars instead.

loading story #42707284
tried:

  simplex.goto("www.amazon.com")
  simplex.wait(2000)
  simplex.click('the search bar')
  simplex.type("bicycle")
  simplex.press_enter()
  simplex.wait(1000)
  image = simplex.extract_image("first 3 rows of results")
get error:

  simplex error processing error
loading story #42709732
Looks pretty cool. How do you distinguish Simplex from Skyvern or UI.Vision?
loading story #42706501
Very cool, any chance of either open-sourcing it or allowing the browser part to be self-hosted? i.e. to act on websites hosted in a local lan/vpn?

Also, did you evaluate https://github.com/browser-use/browser-use by any chance and have any comments about it? I'm assuming it was too AI-heavy based on what you said about claude/etc?

loading story #42715737
VLMs are great - I have been able to use it for a similar project too [1]. And it's only going to get better. Congratulations on the product launch what's your VLM model for this?

1. A framework to use/control mobile phones via any LLM - https://github.com/BandarLabs/clickclickclick

loading story #42707559
loading story #42713360
I've tried the following code:

    simplex.goto("github.com/mitchellh")
    num_contribs = simplex.extract_text("blabla")
    print(num_contribs)
It outputs all texts from the page. Is it expected? Maybe it should fail indicating element could not be found?
loading story #42707950
Nice job! Does this only work with Python? In your docs under API reference, I only see 1 endpoint '/find-element'. If I wanted to use this via REST API without python, is that possible? Also, what kind of pricing can be expected?
loading story #42713657
I have to say, looks both simple and 20 times better then those horrible no-code solutions with boxes and arrows.

I think you are onto something here.

loading story #42706990
Nice.

Does it work with sites that have cloudflare antibot (or similar) functions?

Can this use a cloud browser API like browserless?
loading story #42707427
it fails for this query search("amazon.in", "fitness watch")
loading story #42706888