Добавить новость




Новости сегодня

Новости от TheMoneytizer

DeepSeek Boosts OCR Performance With Alibaba Open-Source AI

eWeek 

Chinese AI company DeepSeek has dropped some intriguing news in the document processing world.

The firm has unveiled DeepSeek-OCR 2, a completely revamped optical character recognition system that replaces its previous architecture with Alibaba’s Qwen AI technology.

Tech swap

DeepSeek has ditched OpenAI’s CLIP framework that powered its original system and swapped it for Alibaba Cloud’s lightweight Qwen2-0.5b model.

This new approach delivers a 3.7% performance boost over the previous version, which might sound modest until you consider what’s happening under the hood. DeepSeek-OCR 2 now processes documents the way humans actually read them—dynamically rearranging and understanding content based on context and meaning rather than rigid, mechanical scanning.

Efficiency breakthrough

Built on DeepSeek‘s proprietary DeepEncoder V2 architecture, this system can compress complex document pages into just 256 to 1,120 visual tokens. Compare that to traditional systems that often require thousands of tokens for similar processing.

Testing on OmniDocBench v1.5 revealed the system’s 91.09% overall score—a significant jump that translates to real-world performance gains in reading order recognition and layout understanding.

DeepSeek claims the system cuts computational costs for downstream language models while maintaining accuracy that rivals human-level document comprehension.

The Alibaba collaboration

This collaboration highlights something bigger than just technical improvements. The update showcases China‘s growing open-source AI ecosystem, where companies are building on each other’s innovations to create increasingly powerful tools.

Consider the rapid iteration cycle—this upgrade comes just over three months after DeepSeek launched its first OCR system.

Face forward

DeepSeek has open-sourced the entire system on Hugging Face, meaning developers worldwide can access and build upon this technology. The implications are enormous for industries that handle massive document volumes—from legal firms processing contracts to healthcare organizations digitizing patient records.

The semantic reasoning approach enables scanning patterns that adapt to different document types automatically. Instead of rigid, line-by-line processing, this development reorganizes visual information based on relevance in each specific document.

For businesses drowning in paperwork, this could represent a shift from expensive, time-consuming document processing to automation that actually understands what it’s reading.

A documentary premiering at Sundance this week reckons it reveals the disturbing reality behind AI’s rapid rise.

The post DeepSeek Boosts OCR Performance With Alibaba Open-Source AI appeared first on eWEEK.

Читайте на сайте


Smi24.net — ежеминутные новости с ежедневным архивом. Только у нас — все главные новости дня без политической цензуры. Абсолютно все точки зрения, трезвая аналитика, цивилизованные споры и обсуждения без взаимных обвинений и оскорблений. Помните, что не у всех точка зрения совпадает с Вашей. Уважайте мнение других, даже если Вы отстаиваете свой взгляд и свою позицию. Мы не навязываем Вам своё видение, мы даём Вам срез событий дня без цензуры и без купюр. Новости, какие они есть —онлайн с поминутным архивом по всем городам и регионам России, Украины, Белоруссии и Абхазии. Smi24.net — живые новости в живом эфире! Быстрый поиск от Smi24.net — это не только возможность первым узнать, но и преимущество сообщить срочные новости мгновенно на любом языке мира и быть услышанным тут же. В любую минуту Вы можете добавить свою новость - здесь.




Новости от наших партнёров в Вашем городе

Ria.city
Музыкальные новости
Новости России
Экология в России и мире
Спорт в России и мире
Moscow.media










Топ новостей на этот час

Rss.plus