DeepSeek, a Chinese company, took the world by surprise Jan. 20 with the release of its multimodal large language models (LLMs) DeepSeek-R1 and DeepSeek-R1-Zero, providing performance on par with OpenAI's o1 model. These models leverage an open architecture with 671 billion total parameters. DeepSeek – solely funded by Chinese hedge fund High-Flyer and founded by Liang Wenfeng in Hangzhou, Zhejiang – has rapidly emerged as a major player in artificial intelligence (AI) research.
DeepSeek V3, released in December 2024, was a "standard" language model akin to OpenAI's GPT-4. In contrast, the recently launched R1 is a reasoning model, comparable to OpenAI's o1. China has been steadily advancing in AI research, producing high-quality models such as the Qwen family, developed by Alibaba. The intense competition in China has led to a price war, significantly reducing the cost of LLM hosting and fostering rapid innovation.
Geopolitical and technological implications
At least two significant geopolitical factors have driven China's accelerated AI innovation:
Restricted access to Western AI models: As of January 2025, OpenAI's GPT models and Meta's Llama models are either blocked or officially unavailable in China due to government regulations and provider decisions. This restriction has spurred Chinese companies to develop their own models.
Hardware constraints: U.S. export restrictions limit the availability of high-performance Nvidia Graphical Processing Units (GPUs) in China. GPUs are the primary computing hardware used to train models. Despite these constraints, DeepSeek reportedly trained its V3 model – comparable in performance to GPT-4 – on only 5% of the GPUs OpenAI used. This demonstrates efficiency that could enable more cost-effective model training and fine-tuning with fewer computational resources.
Open vs. closed models: A crucial distinction for insurance professionals
This distinction is particularly important for the insurance industry, where transparency, interpretability, and cost-efficiency are key factors in AI adoption.
Closed models
Closed LLMs – such as OpenAI's GPT models, Google's Gemini, Anthropic's Claude, and AWS's Titan – keep their training data, architecture, and weights proprietary. They are only accessible via APIs, meaning insurance companies cannot audit the inner workings of these models, raising concerns about data privacy, compliance, and explainability.
Open models
Open-source models – including Meta's Llama, Mistral, and DeepSeek's offerings – provide access to the trained weights, allowing companies to deploy and fine-tune models on their own infrastructure. This transparency enhances compliance, particularly in regulated industries such as insurance.
Applications in life insurance: Fine-tuning for cost efficiency
Example use cases
- Medical document analysis: Fine-tuning an open model on historical underwriting data can enhance accuracy in extracting conditions, treatments, and risk factors from unstructured medical records.
- Fraud detection: Training a model on anonymized claims data can help detect patterns indicative of fraudulent activity, reducing costs associated with false claims.
- Customer service automation: Deploying a customized chatbot to assist policyholders with policy inquiries and claims processes can improve response time and customer satisfaction. Such a bot would require considerable fine-tuning to match tone and the extreme need for accuracy.
With open models, life insurers can fine-tune AI to their specific underwriting needs while ensuring compliance with industry regulations such as Health Insurance Portability and Accountability Act (HIPPA) and the General Data Protection Regulation (GDPR).
Running models in a secure environment allows for full data control, unlike using third-party APIs, where data handling is dictated by external providers.
How will your data be used?
One of the most critical considerations for
Best practices for ensuring data privacy
- Deploy models in-house or on trusted cloud infrastructure: Avoid sending sensitive underwriting or claims data to third-party APIs whenever possible.
- Review legal agreements for API usage: Regardless of whether your model is open or closed, many API providers collect usage data for retraining and optimization. Ensure contracts specify that proprietary data will not be used for future model improvements.
- Implement access controls and auditing: Restrict access to AI-generated outputs to authorized personnel and maintain audit logs for compliance purposes.
The future of open AI in insurance
The rise of open AI models presents a transformative opportunity for life insurers. By embracing open-source LLMs, insurers can reduce costs, enhance accuracy, and maintain control over sensitive data. DeepSeek's advances illustrate that cutting-edge AI might no longer be confined to closed ecosystems and that open innovation is rapidly leveling the playing field.
As insurers navigate the evolving AI landscape, the strategic adoption of open models will empower them to harness the full potential of generative AI while safeguarding data privacy, improving operational efficiency, and driving business growth. The future of insurance AI is open, and forward-thinking companies will be the ones to capitalize on this shift.