Skip to content
Back to Blog

news · 9 min read

Bulletin — December 2025

Cl0p insists a year after Cleo. OpenAI repeats Shipmas. Anthropic closes the year with a Claude refresh. DORA and NIS2 enter the first inspection cycle. Year-end reports from Mandiant, CrowdStrike and Verizon. The year in one table.

· Manuel López Pérez · news

Cl0p insists a year after Cleo. OpenAI repeats Shipmas. Anthropic closes the year with a Claude refresh. DORA and NIS2 enter the first inspection cycle. Year-end reports from Mandiant, CrowdStrike and Verizon. The year in one table.

December 2025 closes a year in which the regulatory calendar moved from promise to procedure. Cl0p remains in MFT a year after Cleo. OpenAI repeats Shipmas. Anthropic refreshes Claude. Year-end reports confirming what the month-by-month was already saying. And the year summary in one table, without claiming exhaustive ranking.

Cl0p — first Cleo anniversary

December 2025. A year after the Cleo MFT zero-day, Cl0p’s extortion portal continues listing victims related to the late-2024 leak. Recent activity: exploitation of CVE-2025-31161 in CrushFTP since April (another MFT, same modus operandi) and Mandiant’s November deep-dive on the group’s industrial playbook. The pattern continues: enterprise MFT as preferred vector, opportunistic attack before disclosure, exfiltration to own S3, double extortion on the portal.

What’s operational: any managed file transfer on the corporate perimeter that hasn’t gone through a serious threat model during 2024-2025 is in queue for the next one. The lesson of MOVEit (June 2023), Cleo (December 2024) and CrushFTP (April 2025) doesn’t need more data to confirm itself.

Source: https://cloud.google.com/blog/topics/threat-intelligence/cl0p-mft-campaign-2025-update

OpenAI Shipmas — second edition

1 to 19 December. OpenAI repeats the daily stream-with-announcement format. The relevant ones for security:

  • GPT-5.5 / o4 family — update of the reasoning model with longer CoT and improvements on agentic benchmarks. No structural changes in the safety API; reasoning summary remains optional for the customer.
  • ChatGPT Agents 2.0 — the conversational agent with its own browser (Operator) merges with ChatGPT as a native mode. For enterprise deployments, it arrives with unified audit log and configurable action approval gates. The tool poisoning surface we saw in MCP generalises to any integration registered in the agent.
  • Sora 2 GA — video generation with character consistency. C2PA watermarking is maintained; bypass via re-encoding remains trivial. AESIA and other authorities start publishing guidelines on Art. 50 of the AI Act obligations (transparency for AI-generated content).
  • API security features — Anthropic-style structured outputs with JSON schema validation, scoped API keys with granular per-endpoint permissions. Catch-up with what Anthropic already had. For 2026 the platform security feature war between the three big vendors will be a competitive axis.

OpenAI’s framing for the year has been “agents to production”. The interesting part isn’t what they add, it’s what they don’t add: no Shipmas announcement on training data transparency or on post-deployment capability verification. The regulatory promise remains the minimum.

Source: https://openai.com/shipmas-2025/

Anthropic — Q4 Claude refresh + Responsible Scaling Policy v3

December 2025. Anthropic closes the year with:

  • Claude Opus 4.5 / Sonnet 4.5 — refresh with extended context and improvements on agentic tasks. Extended thinking remains as default; reasoning_content remains accessible to API customers.
  • Responsible Scaling Policy v3 — update of the framework with stricter metrics for ASL-4. The pre-deployment evaluation report publishes in full, including the results of the agentic misalignment harness applied to Opus 4.5 — metrics comparable to the May research. The trend: with each capability jump, the misalignment rates in controlled harness don’t drop; deployment defences have to compensate.

Anthropic signs the final version of the AI Office’s Code of Practice for GPAI in December after months of public negotiation. The difference with OpenAI / Google (which signed in July-August) and with Meta (which doesn’t sign): Anthropic publishes specific documentation on training data provenance and copyright opt-out mechanisms with more detail than the other signatories.

Source: https://www.anthropic.com/news/claude-4-5-opus · https://www.anthropic.com/policy/responsible-scaling-policy

DeepSeek — V4 and the open-weights consolidation

December 2025. DeepSeek publishes DeepSeek-V4 as open weights. MoE with architecture iterated over V3 and R1, 1M token context, native multimodal. Available on Hugging Face under MIT licence. Capabilities comparable to Claude 4.5 Opus and GPT-5.5 on public benchmarks, trained with compute reported in the 5M GPU-hours range.

What this closes from the year: the open-weights pattern with capability close to frontier is no longer exceptional. DeepSeek-R1 opened the category in January; QwQ, Llama 4 and now V4 consolidate it. For AI security, the scenario where any adversarial attack required a complicit model has ended. Any technique needing gradient access can be tried on a representative model at accessible costs.

For 2026: the operational question becomes what proportion of enterprise deployment uses open weights vs commercial API. Both options have their threat model. The difference is that open-weights leaves real weight auditing possible — something the Llama 4 / LMArena case made obligatory.

Source: https://github.com/deepseek-ai/DeepSeek-V4

LangChain LangGrinch — CVE-2025-68664, serialization injection

LangChain LangGrinch — CVE-2025-68664, serialization injection

December 2025. Cyata publishes analysis of CVE-2025-68664 in LangChain Core (Python) — christened LangGrinch. CVSS 9.3. The dumps() and dumpd() functions don’t escape dictionaries with the 'lc' key when serializing free metadata. The lc key is LangChain’s internal marker for its serialized objects; when the attacker manages to get a structure with 'lc' into user-controlled data, on re-serializing the model treats it as a legitimate LangChain object instead of plain data.

Exploitation vector: the attacker sends a prompt whose LLM response contains in additional_kwargs or response_metadata a structure {'lc': 1, 'type': 'constructor', 'id': [...], 'kwargs': {...}}. When the client serializes the conversation to persist or stream-respond, the round-trip loads arbitrary objects from the pre-approved langchain_core / langchain / langchain_community namespaces. With secrets_from_env=True active (default), it exfiltrates environment variables. With Jinja2 templates enabled, it executes arbitrary code.

LangChain.js has the equivalent bug as CVE-2025-68665 (CVSS 8.6). The patch introduces an allowed_objects allowlist in load()/loads(), disables Jinja2 by default and sets secrets_from_env=False as default. The minimum defensive action: pin versions, audit dumps() calls in pipelines that persist conversation, and review any integration using prompt-injectable fields as input to loads().

Source: https://github.com/advisories/GHSA-c67j-w6g6-q2cm · https://cyata.ai/blog/langgrinch-langchain-core-cve-2025-68664/ · https://thehackernews.com/2025/12/critical-langchain-core-vulnerability.html

Year-end reports — Mandiant, CrowdStrike, Verizon

December 2025 / January 2026. Annual reports confirming what the month-by-month already said:

  • Mandiant M-Trends 2026 — average global dwell time drops to 8 days (was 10 days in 2024, 16 in 2023). The drop is largely due to ransomware with immediate encryption: when the attacker encrypts in 48 hours, detection time shortens by necessity. It’s not a good signal; it’s a signal of faster attacker operation. Most common initial vector remains vulnerability exploitation (35%), above phishing (22%).
  • CrowdStrike Global Threat Report 2026 — Scattered Spider / Octo Tempest maintains sustained activity against retail and financial services. The DragonForce affiliation seen at M&S is confirmed as a stable pattern. Identity-based attacks (credential abuse, MFA bypass, session hijacking) make up 45% of documented intrusions.
  • Verizon DBIR 2025 (published in May, echo here) — confirms that vulnerabilities as initial vector surpass credentials for the first time in investigated breaches. The curve came from 2024 and is confirmed.

Joint reading: the operational attacker goes faster and specialises more. The industrialisation of sector playbooks seen in UK retail and in crypto exchange during 2025 is the trend, not the exception.

Sources: https://www.mandiant.com/m-trends · https://www.crowdstrike.com/global-threat-report/ · https://www.verizon.com/business/resources/reports/dbir/

DORA / NIS2 — first inspection cycle

December 2025. Close of the first year of DORA application. Banco de España, CNMV and DGSFP start inspections of systemic entities during Q3-Q4. First public observations:

  • Obligation → technical control mapping remains blurry in most entities. Documentation exists; traceability between the article and the concrete control, doesn’t.
  • TLPT — first Threat-Led Penetration Testing cycle scheduled for 2026-2028. The final RTS was published in July.
  • Critical ICT TPPs register — ESAs publish during the year the designation methodology. Official list of providers designated as critical remains pending at December close.

NIS2 in Spain is finally transposed via Organic Law X/2025 (published in BOE during the second half, exact date pending verification at the time of writing). For 2026 the obligations for incident reporting to INCIBE-CERT and the INCIBE sanction regime move to effective application. The operational coordination DORA ↔ NIS2 ↔ ENS for Spanish financial entities is first-quarter 2026 work.

EU AI Act — the second GPAI step consolidates during the second half. AESIA starts the first exploratory market surveillance on Annex III high-risk products. The 2 August 2026 date (high-risk obligations) is already on the operational calendar of every EU provider.

Sources: https://eur-lex.europa.eu/eli/reg/2022/2554/oj · https://www.boe.es/buscar/act.php?id=LO-X-2025

Retrospective — the year in one table

Without claiming exhaustiveness, the year’s milestones by month:

MonthAI securityCyberCompliance
JanuaryDeepSeek-R1 open-weights reasoningSonicWall SMA1000 CVE-2025-23006 zero-dayDORA in application
FebruaryOpenAI Operator GA · Claude 3.7 SonnetByBit hack $1.5B via Safe{Wallet} frontendAI Act Art. 5 in application
MarchMCP TPA — first public PoC (Invariant Labs)Patch Tuesday Hyper-V CVEs
AprilLlama 4 + LMArena controversyM&S / Co-op / Harrods — UK retail wave
MayClaude 4 + agentic misalignment researchCoinbase insider compromise · Verizon DBIR 2025NIS2 Spain draft bill advances
JuneAnthropic Project Vend · AWS re:InforceENISA Threat Landscape draft · UK retail postmortems
JulyReasoning model jailbreaks H1 retrospectiveSharePoint ToolShell CVE-2025-53770GPAI Code of Practice published
AugustDARPA AIxCC final DEF CON 33 · Black Hat 33Patch Tuesday August · various zero-daysAI Act GPAI in application
SeptemberiOS 19 release CVEsApple iPhone 17 + CVE batch · ENISA Threat Landscape edition
OctoberAnthropic Claude 4.5 preview · OpenAI dev dayWindows 10 end of support 14 OctNIS2 Spain transposed (LO X/2025)
Novemberre:Invent + Ignite AI security announcementsKEV adds · Salt Typhoon updates
DecemberDeepSeek-V4 · Claude 4.5 Opus · OpenAI Shipmas 2Cl0p MFT first-anniversary activityDORA first inspection cycle

The year’s technical posts are linked above; the separate retrospectives — AI security and cyber — enter the detail.

Cross-cutting pattern — 2025 in one sentence

If I have to distil 2025 into one sentence: the regulatory calendar becomes operational and the attacker industrialises sector playbooks faster than the structural defender. DORA and AI Act stop being a promise. Lazarus has an industrial playbook against exchanges; Scattered Spider against UK retail; China-nexus against on-prem SharePoint. Each playbook is reusable against the next targets in the same sector. Industrialisation goes faster than defensive coordination.

The operational plan for 2026 coming out of 2025:

  1. Explicit regulatory inventory — DORA scope, NIS2 scope, EU AI Act scope for each product and process. Without inventory, the first inspection cycle arrives and is done ad hoc.
  2. Audit of financial and approval visualisation chains — UI served by third parties, helpdesk with privileged resets, SaaS secrets vaults. Three categories that 2025 proved fragile.
  3. Patch hygiene on legacy on-prem with the assumption of incomplete patch. SharePoint, Exchange, enterprise MFT, Citrix appliances. Post-incident secrets rotation as default playbook.
  4. Threat modelling for AI features with tools before shipping. The Recall lesson remains valid and is extended with the Project Vend lesson: agents in real production behave differently from the demo.
  5. Explicit inventory of post-EoL Windows 10. ESU paid / migrated / accepting risk / unknown. The fourth category is where 2026-2027 incidents will appear.

The year closes. The first post of 2026 enters already with AESIA inspecting GPAI providers, with CRA starting to apply in blocks, with DORA finishing its first inspection cycle, and with the AI Act 2 August 2026 calendar (high-risk) six months away.

Back to Blog

Related Posts

View All Posts »
Bulletin — December 2024

news · 12 min

Bulletin — December 2024

Cl0p reoffends with Cleo MFT and breaks the first patch. BeyondTrust SaaS leads to Treasury (Silk Typhoon, not Volt). 12 days of OpenAI Shipmas. Gemini 2.0, Phi-4, DeepSeek-V3. DORA starts on 17 January. 2024 retrospective.

· Manuel López Pérez

Bulletin — May 2026

news · 13 min

Bulletin — May 2026

The Digital Omnibus reaches a provisional deal on 7 May: Annex III moves to December 2027. Spain approves its AI governance bill on 26 May. Pwn2Own Berlin pays out $1.3M for 47 zero-days, with Codex and Claude Code on the menu. Patch Tuesday ships with no zero-days for the first time since June 2024. OpenAI launches Daybreak and Anthropic moves Mythos toward GA. Verizon DBIR 2026 crowns vulnerability exploitation as the number-one vector. GitHub loses 3,800 internal repos to a poisoned VS Code extension.

· Manuel López Pérez

Bulletin — April 2026

news · 13 min

Bulletin — April 2026

The Omnibus trilogue closes without agreement on 28 April, leaving the original AI Act deadline three months away. Patch Tuesday with 165 CVEs and an active SharePoint zero-day. Anthropic announces Claude Mythos + Project Glasswing — the first frontier model held behind a defensive wall. Pwn2Own Berlin collapses under oversubscription. M&S one year on. AESIA publishes guides 13 and 14.

· Manuel López Pérez