What if we just prompt user interfaces and streamed them in via video? No DOM, no browser — just a prompt describing what you want and a UI rendered in realtime via a Flux model. Using snapdom to convert the DOM to PNG, then transforming it with style prompts. It actually works surprisingly well — clickable inputs, checkboxes, form submissions and all.

18010206916K
51940
644
68421K