feat: add bilateral receipt signing (pre-execution + post-execution) by arian-gogani · Pull Request #1333 · microsoft/agent-governance-toolkit

arian-gogani · 2026-04-22T17:41:21Z

Extends receipts.py with bilateral receipt support per the design doc in #1302.

Three new functions:

sign_authorization() — signs pre-execution commitment (proves policy evaluated before action ran)
seal_result() — seals post-execution outcome (binds actual result to the authorization)
verify_bilateral_receipt() — verifies both signatures, falls back to verify_receipt() for standard envelopes

Backward compatible. Existing single-signature receipts verify unchanged. Bilateral fields are additive.

The result signature covers the binding of authorization_hash and result_hash together, proving:

The authorization existed before execution
The result was produced after execution
Both were signed by the same key

11 tests pass locally covering: creation, chain linkage, sealing, tamper detection (payload and result), wrong key rejection, and deny receipts.

Relates to #1249.

github-actions · 2026-04-22T17:42:22Z

🤖 AI Agent: security-scanner — Security Review of `feat: add bilateral receipt signing (pre-execution + post-execution)`

Security Review of `feat: add bilateral receipt signing (pre-execution + post-execution)`

This PR introduces bilateral receipt signing for compliance verification, which is a critical feature for ensuring the integrity and verifiability of agent actions. Below is the security analysis of the changes:

1. Prompt Injection Defense Bypass

Assessment: 🔵 LOW
The changes in this PR do not directly handle user input or natural language processing, so prompt injection is not applicable here. The focus is on cryptographic signing and verification of receipts.

2. Policy Engine Circumvention

Assessment: 🟡 MEDIUM
Issue: The sign_authorization() function relies on the payload argument to generate the authorization_hash. If the payload is not properly validated or sanitized before signing, an attacker could potentially inject malicious data into the payload, which would then be signed and treated as valid. This could allow circumvention of policy enforcement if the payload is manipulated to misrepresent the policy or action details.
Fix: Add strict validation for the payload to ensure it adheres to the expected schema and does not contain unexpected or malicious fields. Consider using a JSON schema validator to enforce the structure and content of the payload.

3. Trust Chain Weaknesses

Assessment: 🟠 HIGH
Issue: The verify_bilateral_receipt() function does not validate the signerKeyId field in the signature section of the envelope. This could allow an attacker to use a compromised or unauthorized key to sign receipts, bypassing the trust chain.
Fix: Add a check to ensure that the signerKeyId in the receipt matches the expected key ID for the agent. This can be done by maintaining a trusted key registry or using a secure key management system to verify the key's authenticity.

4. Credential Exposure

Assessment: 🔵 LOW
No sensitive credentials or secrets are logged or exposed in the changes. The private key used for signing is passed as a Signer object, which is assumed to be securely managed outside the scope of this PR.

5. Sandbox Escape

Assessment: 🔵 LOW
The changes do not introduce any new code that interacts with the operating system or external processes, so sandbox escape is not a concern here.

6. Deserialization Attacks

Assessment: 🟡 MEDIUM
Issue: The _canonicalize() function is used to serialize the payload and result data for hashing and signing. If _canonicalize() relies on a vulnerable or improperly configured JSON library, it could be exploited for deserialization attacks.
Fix: Ensure that _canonicalize() uses a secure JSON serialization library that does not allow arbitrary code execution or unsafe object creation. If _canonicalize() is a custom implementation, review it for potential vulnerabilities.

7. Race Conditions

Assessment: 🟡 MEDIUM
Issue: The seal_result() function assumes that the authorization_hash in the envelope has not been tampered with between the sign_authorization() and seal_result() calls. If the envelope is modified in a concurrent environment, it could lead to a mismatch between the authorization and result signatures, potentially invalidating the receipt.
Fix: Add a mechanism to ensure the integrity of the envelope between the sign_authorization() and seal_result() steps. This could involve locking the envelope or using a secure storage mechanism to prevent unauthorized modifications.

8. Supply Chain Risks

Assessment: 🟡 MEDIUM
Issue: The PR does not explicitly mention any new dependencies, but it uses cryptographic functions (e.g., Ed25519PublicKey, hashlib, _b64url, _canonicalize). If these are third-party libraries or custom implementations, they should be reviewed for security vulnerabilities.
Fix: Ensure that all cryptographic libraries are up-to-date and have no known vulnerabilities. If _b64url and _canonicalize are custom functions, review their implementations for security issues.

Additional Observations:

Error Handling: The verify_bilateral_receipt() function does not log or handle exceptions in a way that provides meaningful feedback for debugging. This could make it harder to identify and fix issues during verification.
Recommendation: Add detailed error messages and logging for failed signature verifications to aid in debugging without exposing sensitive information.
Replay Attacks: There is no mention of mechanisms to prevent replay attacks, where a valid receipt could be reused maliciously.
Recommendation: Consider adding a nonce or timestamp validation to ensure that each receipt is unique and cannot be reused.
Documentation: The proposal document is well-written and provides a clear explanation of the feature. However, it would be helpful to include explicit security considerations and threat models to guide future development and audits.

Summary of Findings:

Category	Severity	Issue	Fix
Prompt Injection Defense Bypass	🔵 LOW	Not applicable.	None
Policy Engine Circumvention	🟡 MEDIUM	Lack of strict validation for `payload`.	Add schema validation for `payload`.
Trust Chain Weaknesses	🟠 HIGH	`signerKeyId` is not validated.	Validate `signerKeyId` against a trusted key registry or KMS.
Credential Exposure	🔵 LOW	No credentials exposed.	None
Sandbox Escape	🔵 LOW	Not applicable.	None
Deserialization Attacks	🟡 MEDIUM	Potential risks in `_canonicalize()` implementation.	Ensure secure JSON serialization and review `_canonicalize()` for vulnerabilities.
Race Conditions	🟡 MEDIUM	Potential tampering of `authorization_hash` between signing and sealing.	Use locking or secure storage to protect the envelope between steps.
Supply Chain Risks	🟡 MEDIUM	Cryptographic libraries and custom functions not explicitly reviewed.	Ensure all libraries are up-to-date and review custom functions for security vulnerabilities.
Replay Attacks	🟡 MEDIUM	No mechanism to prevent replay attacks.	Add nonce or timestamp validation to ensure receipt uniqueness.

Final Recommendation:

The PR introduces a critical feature for compliance and verifiability, but there are several medium and high-severity issues that need to be addressed before merging. Specifically:

Add schema validation for payload in sign_authorization().
Validate signerKeyId against a trusted key registry or KMS.
Review _canonicalize() and _b64url for security vulnerabilities.
Implement mechanisms to prevent replay attacks.
Add error handling and logging for failed verifications.

Once these issues are resolved, the PR can be considered for merging.

github-actions

🤖 AI Agent: code-reviewer

Review Summary

This PR introduces a significant enhancement to the receipts.py module by adding support for bilateral receipt signing, which includes pre-execution authorization (sign_authorization) and post-execution result sealing (seal_result). Additionally, it provides a verification mechanism (verify_bilateral_receipt) to validate both the authorization and result signatures. The implementation appears to be backward compatible, as it does not modify the existing single-signature receipt verification logic.

The changes are well-documented, with a detailed proposal included in the PR. The new functionality is critical for ensuring compliance with regulations like the EU AI Act and SOC 2, as it provides independently verifiable compliance evidence.

🔴 CRITICAL

Replay Attack Vulnerability in Authorization Signature:
- The sign_authorization function does not include a unique nonce or identifier in the payload before signing. This could allow an attacker to reuse a valid authorization signature in a different context, leading to a potential security bypass.
- Recommendation: Include a unique, cryptographically secure nonce in the payload before signing. This ensures that each authorization signature is unique and cannot be reused in a replay attack.
Lack of Expiry for Authorization Receipts:
- The sign_authorization function does not enforce an expiration time for the authorization. This could allow an attacker to use an old authorization receipt to execute actions long after it was issued.
- Recommendation: Add an expires_at field to the authorization payload and validate it during the verify_bilateral_receipt process.
Insufficient Validation of Input Data:
- The sign_authorization, seal_result, and verify_bilateral_receipt functions do not validate the structure or types of the input data (e.g., payload, result_data, envelope). This could lead to unexpected behavior or security vulnerabilities if malformed data is passed.
- Recommendation: Use Pydantic models to validate the input data for these functions. This will ensure type safety and prevent potential issues from malformed inputs.
Canonicalization Process Not Validated:
- The _canonicalize function is used to generate canonical JSON for signing, but there is no validation to ensure that the canonicalization process is consistent and secure.
- Recommendation: Add tests to verify that the _canonicalize function produces consistent and correct output for various edge cases, including nested JSON structures, special characters, and large payloads.

🟡 WARNING

Potential Breaking Change in verify_bilateral_receipt:
- The verify_bilateral_receipt function falls back to verify_receipt() for non-bilateral envelopes. While this ensures backward compatibility, any changes to the verify_receipt() function in the future could inadvertently affect the behavior of verify_bilateral_receipt.
- Recommendation: Clearly document this dependency in the code and consider adding tests to ensure that changes to verify_receipt() do not break verify_bilateral_receipt.
Backward Compatibility Assumptions:
- The PR assumes that the addition of new fields (e.g., bilateral, result) to the receipt structure will not break existing consumers of the receipts.py module. While this is likely true, it should be explicitly tested.
- Recommendation: Add tests to ensure that existing functionality (e.g., single-signature receipt verification) continues to work as expected with the new fields present.

💡 SUGGESTIONS

Improve Error Messages:
- The error messages in seal_result (e.g., "Cannot seal a non-bilateral receipt") could be more descriptive to aid debugging.
- Recommendation: Include additional context in error messages, such as the current state of the envelope or the expected state.
Thread Safety:
- The sign_authorization and seal_result functions rely on the Signer object, which includes a private_key. If the Signer object is shared across threads, this could lead to race conditions or other concurrency issues.
- Recommendation: Document whether the Signer object is thread-safe. If it is not, consider adding locks or requiring that each thread use its own Signer instance.
Extend Test Coverage:
- While the PR mentions that 11 tests pass locally, it is unclear whether edge cases (e.g., malformed envelopes, invalid signatures, missing fields) are thoroughly tested.
- Recommendation: Add tests for edge cases, including:
  - Missing or malformed authorization and result sections.
  - Invalid or tampered signatures.
  - Receipts with unexpected or additional fields.
Performance Considerations:
- The verify_bilateral_receipt function performs multiple cryptographic operations (e.g., signature verification, hash computation). For high-throughput scenarios, this could become a bottleneck.
- Recommendation: Benchmark the performance of the verification process and consider optimizations if necessary. For example, caching intermediate results or parallelizing verification steps.
Documentation Improvements:
- The proposal document is comprehensive, but the code itself could benefit from more inline comments explaining the rationale behind key design decisions (e.g., why certain fields are included in the receipt).
- Recommendation: Add comments to the code to explain the purpose of each field in the receipt and the reasoning behind the design choices.

Summary of Actionable Feedback

🔴 CRITICAL

Add a cryptographically secure nonce to the payload in sign_authorization to prevent replay attacks.
Include an expires_at field in the authorization payload and validate it during verification.
Use Pydantic models to validate input data for sign_authorization, seal_result, and verify_bilateral_receipt.
Add tests to validate the _canonicalize function for edge cases.

🟡 WARNING

Document the dependency of verify_bilateral_receipt on verify_receipt() and add tests to ensure future changes do not break compatibility.
Add tests to confirm that existing single-signature receipt functionality remains unaffected by the new fields.

💡 SUGGESTION

Improve error messages in seal_result for better debugging.
Document or ensure thread safety of the Signer object.
Extend test coverage to include edge cases and malformed inputs.
Benchmark and optimize the performance of verify_bilateral_receipt if necessary.
Add inline comments to explain the rationale behind key design decisions in the code.

Let me know if you need further clarification or assistance!

arian-gogani · 2026-04-22T17:44:53Z

@microsoft-github-policy-service agree

@arian-gogani

…rifier Two harness updates from Apr 28-29 GitHub activity: - Nobulex (@arian-gogani) merged @nobulex/crypto into Microsoft Agent Governance Toolkit (microsoft/agent-governance-toolkit#1333, OpenSSF passing badge). Material elevation — bilateral-receipt primitive is now Microsoft-shipped, not vendor-isolated. - msaleme (Michael Saleme) volunteered to add their x402 conformance harness as 6th independent verifier (A2A #1672 comment 2026-04-29). Targeting v4.5 with claim_type-tagged output. The 41 existing x402 tests map onto continuity-layer evidence_basis.evidence_type. payment_execution lane. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

Creates ADOPTERS.md as referenced in the README. Nobulex's bilateral receipt primitive was merged into AGT via PRs microsoft#1302 and microsoft#1333 (216 LOC, 11 tests). 8 independent implementations have byte-match validated the JCS/RFC 8785 canonicalization substrate. AAIF Growth-stage proposal filed with TC sponsorship from the CTEF maintainer.

…icrosoft#1333) * Create verifiable-compliance-receipts.md * add bilateral receipt signing (pre-execution + post-execution)

arian-gogani added 2 commits April 21, 2026 16:43

Create verifiable-compliance-receipts.md

d809422

add bilateral receipt signing (pre-execution + post-execution)

28308a3

github-actions Bot added size/L Large PR (< 500 lines) documentation Improvements or additions to documentation labels Apr 22, 2026

github-actions Bot reviewed Apr 22, 2026

View reviewed changes

imran-siddique merged commit 1bb9a9b into microsoft:main Apr 22, 2026
10 of 11 checks passed

imran-siddique mentioned this pull request Apr 22, 2026

[Feature]: independently verifiable compliance evidence via hash-chained receipts #1249

Closed

1 task

aeoess mentioned this pull request Apr 23, 2026

Agent Passport System: cryptographic enforcement layer for ASI-03, ASI-07, ASI-08, ASI-10 OWASP/www-project-top-10-for-large-language-model-applications#812

Open

kenneives mentioned this pull request Apr 29, 2026

Proposal: Agent Identity Verification for Agent Cards a2aproject/A2A#1672

Open

kenneives mentioned this pull request Apr 29, 2026

Proposal: Cryptographic Agent Identity extension (CTEF v0.3.1-aligned) a2aproject/A2A#1786

Open

arian-gogani mentioned this pull request May 1, 2026

[Project Proposal] Nobulex aaif/project-proposals#20

Open

arian-gogani mentioned this pull request May 2, 2026

Add ADOPTERS.md with Nobulex as first adopter #1703

Open

27 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add bilateral receipt signing (pre-execution + post-execution)#1333

feat: add bilateral receipt signing (pre-execution + post-execution)#1333
imran-siddique merged 2 commits intomicrosoft:mainfrom
arian-gogani:feat/bilateral-receipts

arian-gogani commented Apr 22, 2026

Uh oh!

github-actions Bot commented Apr 22, 2026

Security Review of `feat: add bilateral receipt signing (pre-execution + post-execution)`

1. Prompt Injection Defense Bypass

2. Policy Engine Circumvention

3. Trust Chain Weaknesses

4. Credential Exposure

5. Sandbox Escape

6. Deserialization Attacks

7. Race Conditions

8. Supply Chain Risks

Additional Observations:

Summary of Findings:

Final Recommendation:

Uh oh!

github-actions Bot left a comment

Uh oh!

arian-gogani commented Apr 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

arian-gogani commented Apr 22, 2026

Uh oh!

github-actions Bot commented Apr 22, 2026

Security Review of feat: add bilateral receipt signing (pre-execution + post-execution)

1. Prompt Injection Defense Bypass

2. Policy Engine Circumvention

3. Trust Chain Weaknesses

4. Credential Exposure

5. Sandbox Escape

6. Deserialization Attacks

7. Race Conditions

8. Supply Chain Risks

Additional Observations:

Summary of Findings:

Final Recommendation:

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

🤖 AI Agent: code-reviewer

Review Summary

🔴 CRITICAL

🟡 WARNING

💡 SUGGESTIONS

Summary of Actionable Feedback

🔴 CRITICAL

🟡 WARNING

💡 SUGGESTION

Uh oh!

arian-gogani commented Apr 22, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Security Review of `feat: add bilateral receipt signing (pre-execution + post-execution)`