RL is the answer to agentic flow: OpenAI's recent deep search
shows this. For
developers, product dev will be less about
details of implementation but more
on high-level alignment
on result.
The future product dev will be the following: developer /
user specifies what he
wants, agent spends available compute
to execute this and deliver result. The
result is a product
much more agile and flexible. User can achieve much more
with the product than the developer even intended or expected.
Today's product dev and agentic flow still focuses on controllability and
cost reduction.
But over time, the cost of compute will drop,
and because developer's time is more
expensive, development
will increasingly rely on smart agents to figure out
action.