The First Fully General Computer Action Model

Hacker News

February 23, 2026

AI-Generated Deep Dive Summary

The article introduces FDM-1, a groundbreaking foundation model designed for general computer use. Unlike previous models that relied on vision-language approaches and limited datasets, FDM-1 is trained directly on video data from an extensive 11-million-hour screen recording dataset. This innovative approach allows it to perform complex tasks such as CAD modeling, financial analysis, engineering automation, and even driving a car in real-time at 30 frames per second (FPS). The model’s ability to process long sequences of context—minutes or hours of video—marks a significant leap forward in AI capabilities for computer interaction. FDM-1 achieves this by using a highly efficient video encoder that compresses nearly two hours of video into just one million tokens, making it 50 times more efficient than previous state-of-the-art models and 100 times better than OpenAI’s encoder. This efficiency enables the model to scale up significantly while maintaining high performance. What

Verticals

techstartups

Originally published on Hacker News on 2/23/2026