Arbitrary File Write via Archive Extraction (Zip Slip) Affecting docling package, versions [,2.91.0)


Severity

Recommended
0.0
high
0
10

CVSS assessment by Snyk's Security Team. Learn more

Threat Intelligence

EPSS
0.12% (31st percentile)

Do your applications use this vulnerable package?

In a few clicks we can analyze your entire application and see what components are vulnerable in your application, and suggest you quick fixes.

Test your applications
  • Snyk IDSNYK-PYTHON-DOCLING-17151751
  • published4 Jun 2026
  • disclosed3 Jun 2026
  • creditUnknown

Introduced: 3 Jun 2026

NewCVE-2026-44017  (opens in a new tab)
CWE-29  (opens in a new tab)

How to fix?

Upgrade docling to version 2.91.0 or higher.

Overview

docling is a SDK and CLI for parsing PDF, DOCX, HTML, and more, to a unified document representation for powering downstream workflows such as gen AI applications.

Affected versions of this package are vulnerable to Arbitrary File Write via Archive Extraction (Zip Slip) in easyocr_model.py process. An attacker who can control the model download source or intercept the download process can overwrite arbitrary files on the file system by supplying a malicious ZIP archive containing malicious member paths. This can lead to execution of arbitrary code, persistent backdoors, or system compromise.

Details

It is exploited using a specially crafted zip archive, that holds path traversal filenames. When exploited, a filename in a malicious archive is concatenated to the target extraction directory, which results in the final path ending up outside of the target folder. For instance, a zip may hold a file with a "../../file.exe" location and thus break out of the target folder. If an executable or a configuration file is overwritten with a file containing malicious code, the problem can turn into an arbitrary code execution issue quite easily.

The following is an example of a zip archive with one benign file and one malicious file. Extracting the malicous file will result in traversing out of the target folder, ending up in /root/.ssh/ overwriting the authorized_keys file:


+2018-04-15 22:04:29 ..... 19 19 good.txt

+2018-04-15 22:04:42 ..... 20 20 ../../../../../../root/.ssh/authorized_keys

CVSS Base Scores

version 4.0
version 3.1