Showing posts with label Stanford. Show all posts
Showing posts with label Stanford. Show all posts

Tuesday, December 16, 2025

Security and technology: Commercial first, production requirements from day one, and programs of capability

The Stanford DEFCON Technology and National Security Student Network organized and hosted a stellar all day event about all things defense and technology with attendees from the whole ecosystem: Government representatives, primes, neoprimes, startups, students, professors, professionals, and venture capitalists and angel investors.




My three favorite quotes of day (without attribution - Chatham House rules 😎 )


👉 The best way to develop is off the government system and go commercial first


✊ Your production requirements need to be built on day one and in conjunction with product requirements - otherwise the cost of change is just to high


👍 The military needs to move to programs of capability and warfare as a service


It used to be that defense tech startups had to be funded by SBIRs and the like, but the massive technology disruptions have given rise to a new dawn that is venture financed.



A version of this post was first published on LinkedIn in November 2025

Monday, December 15, 2025

Unresolved Software Challenges in Robotics in 2025

A previous post laid out the three key themes at this year's Bay Area Robotics Symposium (BARS) at Stanford.

In this post I will describe several open fundamental issues and some of the critical software challenges which were highlighted and remain unresolved:




𝟭. 𝗣𝗼𝗹𝗶𝗰𝘆 𝗥𝗲𝗹𝗶𝗮𝗯𝗶𝗹𝗶𝘁𝘆 𝗮𝗻𝗱 𝗦𝗮𝗳𝗲𝘁𝘆

Robotics still lacks robust mechanisms for real-time failure detection, safety guarantees under learned policies, and predictable behavior in out-of-distribution conditions


𝟮. 𝗘𝗻𝘃𝗶𝗿𝗼𝗻𝗺𝗲𝗻𝘁 𝗥𝘂𝗻𝘁𝗶𝗺𝗲 𝗕𝗼𝘁𝘁𝗹𝗲𝗻𝗲𝗰𝗸𝘀

Model inference is improving rapidly; simulation performance is not. Python-based environments remain a major constraint, and high-speed C++/GPU simulators are still nascent.


𝟯. 𝗧𝗵𝗲 𝗛𝘂𝗺𝗮𝗻–𝗥𝗼𝗯𝗼𝘁 𝗘𝗺𝗯𝗼𝗱𝗶𝗺𝗲𝗻𝘁 𝗚𝗮𝗽

Significant progress has been made, but mapping human intent onto robot morphology continues to be a major open challenge—especially for contact-rich or bimanual manipulation.


𝟰. 𝗟𝗼𝗻𝗴-𝗛𝗼𝗿𝗶𝘇𝗼𝗻, 𝗠𝘂𝗹𝘁𝗶-𝗦𝘁𝗲𝗽 𝗘𝘅𝗲𝗰𝘂𝘁𝗶𝗼𝗻

Memory remains a limiting factor. Retrieval-based methods represent progress, but long-sequence stability is unresolved for most architectures.


𝟱. 𝗨𝘀𝗮𝗯𝗶𝗹𝗶𝘁𝘆 𝗼𝗳 𝗥𝗼𝗯𝗼𝘁𝗶𝗰𝘀 𝗦𝗼𝗳𝘁𝘄𝗮𝗿𝗲

The ecosystem remains fragmented. Despite numerous advancements, there is no unified, developer-friendly stack equivalent to “the PyTorch of robotics.”


𝘛𝘩𝘦 𝘣𝘰𝘵𝘵𝘰𝘮 𝘭𝘪𝘯𝘦

Robotics is entering a period of accelerated capability—but progress is constrained less by hardware and more by software infrastructure, data engineering, and simulation bottlenecks.

👉 The largest opportunity now is to build the scalable, reliable software layer that bridges today’s innovations with real-world deployment at scale. 



This post was first published on LinkedIn in November 2025.

Three Themes in Robotics Research in 2025

This year’s Bay Area Robotics Symposium (BARS) brought together leading researchers and students from the University of California, Berkeley, the University of California, Davis, the University of California, Santa Cruz and Stanford University. The short talks touched upon many aspects of robotics across models, learning methods, scenarios and embodiments.

My observations will solely focus on the software aspects of robotics. It is important to remember that academic research of course is not covering all aspects of robotics use in the real world. And amongst the topics, it is not always obvious at the moment in time which technologies will be game changing, which will remain features, and which will disappear into oblivion.

What is clear is the unrelenting speed of developments. The term Vision Language Model (VLM) emerged and gained prominence in 2022, and Vision Language Action (VLA) was coined more recently by Google DeepMind in July 2023. If, like Sleeping Beauty, the roboticist had gone to sleep in 2020 and just woken up in time for the conference she would not recognize today’s world and indeed consider it a fairy tale.




𝘛𝘩𝘳𝘦𝘦 𝘵𝘩𝘦𝘮𝘦𝘴 𝘴𝘵𝘰𝘰𝘥 𝘰𝘶𝘵 𝘢𝘤𝘳𝘰𝘴𝘴 𝘳𝘦𝘴𝘦𝘢𝘳𝘤𝘩 𝘨𝘳𝘰𝘶𝘱𝘴

1️⃣ 𝗩𝗶𝘀𝗶𝗼𝗻 𝗟𝗮𝗻𝗴𝘂𝗮𝗴𝗲 𝗔𝗰𝘁𝗶𝗼𝗻 𝗠𝗼𝗱𝗲𝗹𝘀 𝗔𝗿𝗲 𝗕𝗲𝗰𝗼𝗺𝗶𝗻𝗴 𝘁𝗵𝗲 𝗡𝗲𝘄 𝗥𝗼𝗯𝗼𝘁𝗶𝗰𝘀 𝗦𝘁𝗮𝗰𝗸

Next-generation VLAs are now coordinating perception, reasoning, and control. These next-generation VLAs can plan, adjust, and correct themselves at inference time using techniques like test-time action sampling (RoboMonkey), memory retrieval (MemER), and affordance reasoning (LITEN).

Implication: Robots are shifting from fixed pipelines to inference-time intelligence.



2️⃣ 𝗦𝘆𝗻𝘁𝗵𝗲𝘁𝗶𝗰 𝗮𝗻𝗱 𝗧𝗿𝗮𝗻𝘀𝗳𝗼𝗿𝗺𝗲𝗱 𝗛𝘂𝗺𝗮𝗻 𝗗𝗮𝘁𝗮 𝗜𝘀 𝗥𝗲𝗽𝗹𝗮𝗰𝗶𝗻𝗴 𝗥𝗼𝗯𝗼𝘁 𝗧𝗲𝗹𝗲𝗼𝗽𝗲𝗿𝗮𝘁𝗶𝗼𝗻

A major shift is underway: instead of collecting thousands of robot demonstrations, researchers are turning human data into robot-friendly training material. This includes retargeted human motion (LeVerB), edited human videos (Masquerade), and fast, mocap-free data collection (TWIST2 or Real2Render2Real).

Implication: Robots no longer need endless teleoperation—they can learn from people, videos, and synthetic versions of both. The key constraint is no longer data collection, but data transformation and representation.



3️⃣ 𝗛𝘆𝗯𝗿𝗶𝗱 𝗦𝗶𝗺𝘂𝗹𝗮𝘁𝗶𝗼𝗻 𝗜𝘀 𝘁𝗵𝗲 𝗢𝗻𝗹𝘆 𝗣𝗮𝘁𝗵 𝗧𝗵𝗿𝗼𝘂𝗴𝗵 𝘁𝗵𝗲 𝗥𝗲𝗮𝗹𝗶𝘁𝘆 𝗚𝗮𝗽

Physics engines provide controllability; video models provide realism; world models provide long-horizon prediction. The emerging direction synthesizes all three.

Implication: Future training pipelines will depend on composite simulators, not a single dominant tool.




This post was first published on LinkedIn in November 2025.