Arbitrary File Write via Archive Extraction (Zip Slip) in nltk | CVE-2019-14751

Q: How to fix?

Upgrade nltk to version 3.4.5 or higher.

Threat Intelligence

Proof of Concept

3.22% (87^th percentile)

Do your applications use this vulnerable package?

In a few clicks we can analyze your entire application and see what components are vulnerable in your application, and suggest you quick fixes.

Test your applications

Snyk IDSNYK-PYTHON-NLTK-460224
published22 Aug 2019
disclosed22 Aug 2019
creditMike Salvatore

Report a new vulnerability Found a mistake?

Introduced: 22 Aug 2019

CVE-2019-14751 (opens in a new tab) CWE-29 (opens in a new tab)

How to fix?

Upgrade nltk to version 3.4.5 or higher.

Overview

nltk is a Natural Language Toolkit (NLTK) is a Python package for natural language processing.

Affected versions of this package are vulnerable to Arbitrary File Write via Archive Extraction (Zip Slip). It allows attackers to write arbitrary files via a ../ (dot dot slash) in an NLTK package (ZIP archive) that is mishandled during extraction.

Details

It is exploited using a specially crafted zip archive, that holds path traversal filenames. When exploited, a filename in a malicious archive is concatenated to the target extraction directory, which results in the final path ending up outside of the target folder. For instance, a zip may hold a file with a "../../file.exe" location and thus break out of the target folder. If an executable or a configuration file is overwritten with a file containing malicious code, the problem can turn into an arbitrary code execution issue quite easily.

The following is an example of a zip archive with one benign file and one malicious file. Extracting the malicous file will result in traversing out of the target folder, ending up in /root/.ssh/ overwriting the authorized_keys file:

+2018-04-15 22:04:29 ..... 19 19 good.txt

+2018-04-15 22:04:42 ..... 20 20 ../../../../../../root/.ssh/authorized_keys

References

CVSS Base Scores

version 3.1

Attack Vector (AV)
Local
Attack Complexity (AC)
Low
Privileges Required (PR)
None
User Interaction (UI)
Required

Scope (S)
Unchanged

Confidentiality (C)
None
Integrity (I)
High
Availability (A)
None

Arbitrary File Write via Archive Extraction (Zip Slip) Affecting nltk package, versions [,3.4.5)

Severity