User Need
People need AI that can help across messy real work: chats, terminals, files, voice notes, screenshots, and changing priorities.
Prototype Project Sanshar
A product/research prototype for interfaces where Claude-like systems observe real surfaces, help a person act, admit uncertainty, and leave proof.
Thesis
A helpful AI partner is not only a model response. It is a system that understands what the user is trying to do, which surface produced the signal, what evidence exists, what tool should be used, and whether the result actually reached the user. Sanshar is my independent prototype lab for testing those product questions.
People need AI that can help across messy real work: chats, terminals, files, voice notes, screenshots, and changing priorities.
Assistants can sound confident while missing context, skipping proof, overstepping privacy, or failing to surface the answer.
Sanshar models each prototype interaction as source, decision, action, verifier, and read-back instead of an unstructured chat turn.
Help the user think and act better without requiring them to micromanage every step.
Prototype Map
The prototype is designed so an assistant does not jump from input to answer. Each meaningful event passes through evidence, decision, action, verification, and read-back.
Whether an AI assistant can stay useful across messy surfaces without overclaiming what it saw or did.
Latency, missed events, false confidence, tool choice, surface delivery, user correction, and recovery quality.
Private surfaces, secrets, external mutations, durable learning, and broad capture stay behind explicit gates.
Interaction Loops
The core design pattern is to replace vague autonomy with small loops that can be observed, tested, corrected, and improved.
Capture the event, classify modality and urgency, create a source packet, and decide if the signal deserves attention.
Choose response language, tool, model, verifier, risk gate, and output surface using runtime dimensions instead of static modes.
Prefer local, reversible, bounded action; ask before private, costly, external, broad, or irreversible action.
Compare expected vs observed, verify delivery, record reason codes, and create a reticket when the system misses.
Design Decisions
The design work is not only visual. It is deciding what the system should do when context is stale, the user speaks mixed-language, a voice path is only partly wired, or a surface says "seen" but not "done."
The most important design decision is restraint. A better assistant is not the one that does the most automatically; it is the one that knows which step is safe, which step needs proof, and which step should become a clear question for the person.
Evals
For human-AI tools, evals should test the interface loop, not only the final answer. Sanshar uses scenarios that expose practical agent failures: missed events, stale context, false confidence, noisy channels, permission ambiguity, and output delivery gaps.
Did the answer reach the intended place, and can the system prove it with returned metadata?
Can the assistant route text, voice, screenshots, and files without pretending an unwired path works?
Did the system act locally when safe and ask when the next step required user consent?
When something failed, did it create a useful reticket with reason code and next safe step?
Anthropic Fit
The work I want to do is to build and study Claude experiences that make people more capable: better at learning, debugging, deciding, creating, and coordinating. Sanshar is my proof that I already think in product loops, not only backend architecture.