Deserialization of Untrusted Data Affecting pyarrow package, versions [0.14.0,14.0.1)


Severity

Recommended
0.0
critical
0
10

CVSS assessment made by Snyk's Security Team

    Threat Intelligence

    EPSS
    0.52% (78th percentile)

Do your applications use this vulnerable package?

In a few clicks we can analyze your entire application and see what components are vulnerable in your application, and suggest you quick fixes.

Test your applications
  • Snyk ID SNYK-PYTHON-PYARROW-6052811
  • published 9 Nov 2023
  • disclosed 9 Nov 2023
  • credit Unknown

How to fix?

Upgrade pyarrow to version 14.0.1 or higher.

Overview

pyarrow is a Python API for functionality provided by the Arrow C++ libraries, along with tools for Arrow integration and interoperability with pandas, NumPy, and other software in the Python ecosystem.

Affected versions of this package are vulnerable to Deserialization of Untrusted Data via the IPC and Parquet readers. An attacker can execute arbitrary code by supplying malicious files in IPC, Feather or Parquet formats.

Workaround

If upgrading is not possible, the package pyarrow-hotfix has been released that disables the vulnerability on older versions. This package should be included alongside the pyarrow dependency.

After adding the pyarrow-hotfix package to the project directory dependencies, users can ignore this issue given that the hotfix has been applied and the vulnerable code is no longer used.

Details

Serialization is a process of converting an object into a sequence of bytes which can be persisted to a disk or database or can be sent through streams. The reverse process of creating object from sequence of bytes is called deserialization. Serialization is commonly used for communication (sharing objects between multiple hosts) and persistence (store the object state in a file or a database). It is an integral part of popular protocols like Remote Method Invocation (RMI), Java Management Extension (JMX), Java Messaging System (JMS), Action Message Format (AMF), Java Server Faces (JSF) ViewState, etc.

Deserialization of untrusted data (CWE-502) is when the application deserializes untrusted data without sufficiently verifying that the resulting data will be valid, thus allowing the attacker to control the state or the flow of the execution.

CVSS Scores

version 3.1
Expand this section

Snyk

9.8 critical
  • Attack Vector (AV)
    Network
  • Attack Complexity (AC)
    Low
  • Privileges Required (PR)
    None
  • User Interaction (UI)
    None
  • Scope (S)
    Unchanged
  • Confidentiality (C)
    High
  • Integrity (I)
    High
  • Availability (A)
    High
Expand this section

NVD

9.8 critical