Home / Architects of Autonomy / Liang Wenfeng
Architects of Autonomy

Liang Wenfeng

Founder & CEO, DeepSeek
Living Document Last updated: 7 May 2026
Listen to this portrait (TTS audio)

In January 2025, a Chinese AI startup released a model that matched or exceeded the performance of the most expensive Western foundation models — at a fraction of the training cost. The company was DeepSeek. Its founder was Liang Wenfeng, a former quantitative hedge fund manager with no prior career in AI research. Within weeks, $1 trillion in market capitalisation evaporated from US technology stocks. The message was not subtle: the assumption that autonomous AI requires unlimited capital is wrong.

From Quantitative Finance to Foundation Models

Liang Wenfeng co-founded High-Flyer, a quantitative hedge fund based in Hangzhou, in 2015. The fund was built on machine learning from its inception — using neural networks for market prediction, portfolio construction, and risk management at a time when most Chinese funds relied on traditional quantitative methods. By the early 2020s, High-Flyer had accumulated significant GPU infrastructure for its trading operations, and Wenfeng had developed a deep practical understanding of large-scale computation.

In 2023, Wenfeng founded DeepSeek as a research laboratory dedicated to artificial general intelligence. The timing was deliberate: the release of ChatGPT had triggered a global race for foundation model capability, and Wenfeng recognised that the computational infrastructure he had built for trading could be repurposed for AI research. Where Western labs were raising billions in venture capital to purchase new GPU clusters, Wenfeng already had the hardware.

The Efficiency Thesis

DeepSeek's contribution to the Zero Human Company thesis is fundamentally about efficiency. The DeepSeek-V3 model, released in late 2024, achieved performance comparable to GPT-4 and Claude 3.5 on major benchmarks while reportedly costing approximately $5.6 million to train — a figure that is roughly 1/20th of the estimated training cost for comparable Western models. DeepSeek-R1, a reasoning model released in January 2025, demonstrated chain-of-thought capabilities rivalling OpenAI's o1 at similarly reduced cost.

The architectural innovations that enabled this are technically significant. DeepSeek pioneered Mixture-of-Experts (MoE) architectures at scale, where only a fraction of the model's total parameters are activated for any given input. This dramatically reduces both training and inference costs. The team also developed Multi-head Latent Attention (MLA), a technique that compresses key-value caches during inference, reducing memory requirements without sacrificing quality. These are not incremental improvements; they represent a fundamentally different approach to the cost curve of AI capability.

The Geopolitical Dimension

DeepSeek's emergence is inseparable from the geopolitical context of US-China technology competition. The Biden administration's October 2022 export controls restricted China's access to the most advanced AI chips — specifically NVIDIA's A100 and H100 GPUs. DeepSeek was built under these constraints, reportedly using NVIDIA A100 chips acquired before the ban took effect, and engineering around the limitations through software efficiency rather than hardware brute force.

For boardrooms, this carries a profound strategic implication: hardware advantage is necessary but not sufficient, and software efficiency can partially compensate for hardware restrictions. The assumption that whoever has the most GPUs wins the AI race has been materially challenged. Enterprise leaders planning autonomous operations cannot assume that the most expensive solution is necessarily the most capable one.

Open Source as Strategy

In a move that distinguished DeepSeek from most major AI labs, Wenfeng released DeepSeek's models as open source under permissive licenses. The strategic logic is multifaceted: open-sourcing accelerates adoption, attracts global research talent, establishes DeepSeek's architecture as a reference implementation, and — critically — commoditises the very capability that Western labs charge premium prices for.

The ZHC implication is significant. If foundation model capability becomes an open commodity rather than a proprietary service, the economics of deploying autonomous AI operations shifts fundamentally. Enterprises can run capable AI models on their own infrastructure at marginal cost, rather than paying per-token fees to a handful of providers. Wenfeng's open-source strategy, whether philosophically motivated or strategically calculated, accelerates the timeline for cost-effective autonomous operations across every industry.

What Boards Should Watch

Three developments from DeepSeek merit sustained boardroom attention. First, the cost trajectory: if DeepSeek continues to demonstrate that capable AI models can be trained for single-digit millions rather than hundreds of millions, the barrier to entry for enterprise AI deployment collapses. Second, the inference efficiency: DeepSeek's MoE and MLA innovations reduce the running cost of AI systems in production, which is where enterprise economics are actually decided. Third, the talent pipeline: DeepSeek has demonstrated that world-class AI research can emerge from a relatively small team (~200 researchers) with focused engineering discipline, challenging the assumption that only billion-dollar labs produce frontier capability.

The Architect's Legacy

Liang Wenfeng's contribution to the Zero Human Company thesis is not about building autonomous systems directly — it is about demolishing the cost assumptions that prevented their deployment. If the Western AI lab consensus held that frontier capability requires $100 million training runs, $10 billion in annual compute spend, and armies of PhD researchers, DeepSeek has demonstrated that the same capability can be achieved with an order of magnitude less capital.

For enterprise leaders, this is arguably the most strategically important development in AI since the release of ChatGPT. The question is no longer "can we afford to deploy autonomous AI?" — it is "can we afford not to, when our competitors can deploy equivalent capability at 5% of the cost we assumed?"

Wenfeng represents a new archetype in the AI landscape: the efficiency-first builder who proves that constraints — financial, regulatory, geopolitical — are engineering problems, not existential barriers. The question he poses to every boardroom: if world-class AI capability is now available at commodity prices, what is your excuse for not deploying it?

Further Reading

📩 Free Daily Intelligence

Stay ahead of the builders

Choose what lands in your inbox — boardroom-grade AI analysis, all free.

✓ You're subscribed. Check your inbox.

You're subscribed.

← Satya Nadella All Portraits