> For the complete documentation index, see [llms.txt](https://docs.probe.splx.ai/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.probe.splx.ai/updates/product-updates/august-2025.md).

# August 2025

## New Layout

<figure><img src="/files/gZZkEi9aPm7FdDG1FEFi" alt=""><figcaption></figcaption></figure>

The updated side navigation introduces four main modules, each designed to group related capabilities for easier access. **Available modules depend on your subscription plan**:

1. **AI Red Teaming**
   * **Probe** - Tools for automated red teaming and uncovering vulnerabilities.
   * **Remediation** - Prompt Hardening to strengthen model system prompts against misuse.
   * **Monitoring** - Log Analysis to review and analyze interaction logs to identify risks and improvement areas.
2. **AI Runtime Protection**
   * UI supporting our runtime protection. Here, you can configure guardrail policies, visualize and interpret results, and use the integrated playground to test and refine configurations.
3. **AI Benchmarks**
   * Standardized evaluation of AI models against SPLX probes.
4. **Settings**
   * Manage user accounts, workspaces, permissions, and organizational configurations.

## Public APIs

* [**Probe Run**](/platform-api/api-reference/probe-run.md)**:** Remediation tasks (both static and dynamic, generated from AI analysis) are now included in the probe run endpoint response.
* [**Target**](/platform-api/api-reference/target.md)**:**
  * New **Scores endpoint** returns the overall performance score, category scores, and totals for simulated and successful attacks.
  * New **Target Test Run export endpoint** provides details of all test runs on a single target.
* [**Benchmarks**](/platform-api/api-reference/benchmarks.md)**:** Multiple new endpoints added. All information from the platform’s benchmark feature can now be exported via API, from high-level model details down to specific test cases contributing to scores.

## Models Benchmarked

Our [Model Benchmarks](/ai-benchmarks/model-benchmarks.md) have been extended to include results for the following models:

* Anthropic Claude 3.7 Sonnet
* Anthropic Claude Opus 4.1
* Deepseek-chat-v3.1
* Gemma 3 27 It
* Gemma 3n E4B It
* Gemma-3-12b-it
* Gemma-3-4b-it
* Gemini 2.5 Flash
* Gemini 2.5 Pro
* Llama 3 70B Instruct
* Llama 3 8B Instruct
* Llama 3.1 70B Instruct
* Llama 3.1 8B Instruct
* Llama 3.2 1B Instruct
* Llama 3.2 3B Instruct
* Llama-3.3-70B-Instruct
* Llama-3.3-8B-Instruct
* Mistral Large 2
* Mistral Medium 3
* Mistral Small 3.1
* OpenAI gpt-oss-120b
* OpenAI gpt-oss-20b
* OpenAI GPT 5
* OpenAI GPT 5-chat
* OpenAI o3
* OpenAI o4-mini

## Improvements & Tweaks

### Custom Compliance

* Add item action is now disabled if a new item section is already open.
* Improved naming conventions for better consistency with predefined compliances.
* The same probe can no longer be selected multiple times within a single compliance item.
* Increased contrast on separation lines for clearer distinction between sections.

### Test & Probe Run Improvements

* Test run table now also displays results for **cancelled test runs**, improving consistency, visibility, and traceability.

### Targets & Connections

* [**Azure OpenAI**](/ai-red-teaming/probe/target/index/azure-openai.md): Added input fields for **extra LLM parameters** and **API version.**


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.probe.splx.ai/updates/product-updates/august-2025.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
