_symlink entries in plugin tarballs are skipped (security posture, correct) but now emit a logger.warning so operators can audit what was dropped: "skipping symlink in plugin tarball (not supported for security): <name> -> <target>" Added test_safe_extract_logs_warning_for_skipped_symlink asserting the warning is present in caplog records at WARNING level. All 211 tests pass (+1 new). known-issues.md updated.
254 lines
9.5 KiB
Markdown
254 lines
9.5 KiB
Markdown
# Known Issues — molecule-sdk-python
|
|
|
|
Issues identified in source but not yet filed as GitHub issues (GH_TOKEN
|
|
unavailable in automated agent contexts). Each entry has: location,
|
|
symptom, impact, suggested fix.
|
|
|
|
Format per entry:
|
|
```
|
|
## KI-N — Short title
|
|
|
|
**File:** `<path>:<line>`
|
|
**Status:** TODO comment / identified / partially fixed
|
|
**Severity:** Critical / High / Medium / Low
|
|
**Platform phase:** (optional — which Phase 30 sub-phase is affected)
|
|
|
|
### Symptom
|
|
...
|
|
|
|
### Impact
|
|
...
|
|
|
|
### Suggested fix
|
|
...
|
|
---
|
|
```
|
|
|
|
---
|
|
|
|
## KI-001 — RemoteAgentClient does not implement inbound A2A server
|
|
|
|
**File:** `molecule_agent/client.py`
|
|
**Status:** Known limitation; not yet implemented
|
|
**Severity:** Medium
|
|
**Platform phase:** Phase 30.8b
|
|
|
|
### Symptom
|
|
`RemoteAgentClient` can call other workspaces via A2A (outbound), but cannot
|
|
receive inbound A2A calls. Any workspace that tries to delegate to or message
|
|
this agent will get a connection refused or timeout.
|
|
|
|
### Impact
|
|
Agents running outside the platform's Docker network via `molecule_agent` are
|
|
one-directional. Platform agents cannot push work to them — the remote agent
|
|
must poll or be provisioned with a publicly reachable webhook endpoint.
|
|
|
|
### Suggested fix
|
|
Add an `A2AServerMixin` class that exposes a `FastAPI` or `flask` route
|
|
(`POST /a2a/inbound`) and runs in a background thread alongside the client's
|
|
heartbeat loop. Register the inbound URL with the platform via the
|
|
`/registry/discover` update endpoint when the server starts. See Phase 30.8b
|
|
in the platform `PLAN.md`.
|
|
|
|
---
|
|
|
|
## KI-002 — Delegation has no server-side idempotency key enforcement
|
|
|
|
**File:** `molecule_agent/client.py` (client-side SHA256 key)
|
|
**Status:** Partially mitigated client-side (SHA256 rounded-to-minute)
|
|
**Severity:** Medium
|
|
**Platform phase:** Phase 30.6
|
|
|
|
### Symptom
|
|
The client generates an idempotency key as `SHA256(task + current_minute)`, but
|
|
the platform's `POST /workspaces/:id/delegate` endpoint does not enforce
|
|
idempotency server-side. Two identical tasks sent within the same calendar
|
|
minute produce duplicate processing if the platform accepts both.
|
|
|
|
### Impact
|
|
A workspace container restart mid-delegation (e.g. liveness probe restart) that
|
|
fires the same delegation request twice will result in duplicate side-effects
|
|
(double commits, double API calls, double messages) if the platform has not yet
|
|
stored the first delegation's result.
|
|
|
|
### Suggested fix
|
|
Platform-side: accept an optional `idempotency_key` field in
|
|
`POST /workspaces/:id/delegate`, check for existing non-failed delegation with
|
|
the same `(workspace_id, idempotency_key)`, return HTTP 200 with existing ID
|
|
instead of creating a new row. Client-side key generation is correct; it is
|
|
the server that needs to honor it.
|
|
|
|
---
|
|
|
|
## KI-003 — `_safe_extract_tar` silently skips all symlinks
|
|
|
|
**File:** `molecule_agent/client.py:_safe_extract_tar`
|
|
**Status:** ✅ Resolved
|
|
**Severity:** Low (misleading behavior)
|
|
|
|
### Resolution
|
|
`_safe_extract_tar` now emits a `logger.warning` for every skipped symlink:
|
|
|
|
```
|
|
skipping symlink in plugin tarball (not supported for security): <name> -> <target>
|
|
```
|
|
|
|
The file is still skipped (symlinks are a security risk in untrusted tarballs).
|
|
The warning lets operators audit what was dropped without changing the security
|
|
posture.
|
|
|
|
Added `test_safe_extract_logs_warning_for_skipped_symlink` in
|
|
`tests/test_remote_agent.py` asserting the warning is emitted.
|
|
|
|
### Suggested fix
|
|
Emit a `logger.warning()` for each skipped symlink so operators can see what
|
|
was dropped. Alternatively, allow safe relative symlinks (those resolving
|
|
within the extraction root) while blocking absolute symlinks and `..`-escaping
|
|
symlinks. Document the behavior in the plugin authoring guide.
|
|
|
|
---
|
|
|
|
## KI-004 — Token file races between concurrent instances of RemoteAgentClient
|
|
|
|
**File:** `molecule_agent/client.py` (token caching)
|
|
**Status:** ✅ Resolved
|
|
**Severity:** Low
|
|
|
|
### Resolution
|
|
Added `fcntl.flock` around token read/write operations in `load_token()` and
|
|
`save_token()`:
|
|
|
|
- `load_token()` — acquires a shared lock (`LOCK_SH | LOCK_NB`) before reading.
|
|
Returns `None` immediately if the lock is contended rather than blocking.
|
|
- `save_token()` — acquires an exclusive lock (`LOCK_EX | LOCK_NB`) before
|
|
writing. If the lock is held by another writer, logs a warning and skips the
|
|
write (the in-memory `_token` is still updated so this instance functions
|
|
correctly). Releases the lock in a `finally` block.
|
|
|
|
Concurrent readers are always safe (shared lock allows multiple simultaneous
|
|
readers). Concurrent writers are serialised by the exclusive lock; if a writer
|
|
cannot acquire the lock immediately it gracefully degrades rather than blocking.
|
|
The platform's one-token-per-workspace invariant is preserved — no stale token
|
|
overwrites.
|
|
|
|
---
|
|
|
|
## KI-005 — `validate_manifest` does not check for secrets in bundle manifests
|
|
|
|
**File:** `molecule_plugin/manifest.py:validate_manifest`
|
|
**Status:** ✅ Fixed — `_scan_for_secrets()` added; called from `validate_manifest`
|
|
**Resolved in:** `fix/ki-005-ki-007` branch
|
|
**Severity:** High
|
|
|
|
### Symptom
|
|
`validate_manifest` does not scan the `env:` or `secrets:` fields of a
|
|
`plugin.yaml` for hardcoded credentials (API keys, passwords, tokens). Plugin
|
|
authors could accidentally commit secrets into what should be a generic bundle.
|
|
|
|
### Impact
|
|
Secrets committed to a plugin manifest are visible in the repo and any tarball
|
|
published to PyPI or the plugin registry. Per platform constraints
|
|
(`constraints-and-rules.md`), bundles must never contain secrets.
|
|
|
|
### Suggested fix
|
|
Add a `validate_no_secrets()` check in `validate_manifest` that scans all
|
|
string values in the manifest for patterns matching common secret formats
|
|
(`sk-`, `ghp_`, ` Bearer `, 32+ char hex strings, etc.). Return a
|
|
`ValidationError` with level `HIGH` if any are found, even in example or
|
|
placeholder values. Add a corresponding test with a manifest containing a
|
|
known secret pattern.
|
|
|
|
---
|
|
|
|
## KI-006 — Plugin content integrity not verified client-side (RESOLVED)
|
|
|
|
**File:** `molecule_agent/client.py:verify_plugin_sha256`, `molecule_plugin/manifest.py:validate_manifest`
|
|
**Status:** ✅ Implemented — see SDK PR on `docs/add-claude-md` branch
|
|
**Severity:** Medium (mitigated by platform-side pinned-ref enforcement from molecule-core PR #1019)
|
|
|
|
### Symptom
|
|
`install_plugin()` downloaded and extracted plugin tarballs with no client-side
|
|
content verification. A compromised platform registry serving a tampered tarball
|
|
under a valid pinned-ref would pass `_safe_extract_tar` (no `..` or absolute
|
|
paths) but could contain a malicious `setup.sh`.
|
|
|
|
### Resolution
|
|
Added:
|
|
- `verify_plugin_sha256(plugin_dir, expected)` — computes a content-addressed
|
|
manifest hash over sorted `(relative_path, SHA256(content))` pairs; deterministic
|
|
regardless of extraction order or timestamps.
|
|
- `install_plugin()` reads `plugin.yaml → sha256` after atomic rename and before
|
|
`setup.sh`; mismatches raise `ValueError` and delete the plugin directory.
|
|
- `PLUGIN_YAML_SCHEMA` gains an optional `sha256` field (64-char lowercase hex).
|
|
- `validate_manifest()` validates `sha256` format when present.
|
|
|
|
Platform-side (molecule-core PR #1019) enforces source integrity (pinned git SHAs
|
|
or semver tags). SDK-side closes the content-integrity gap. Together they cover
|
|
both the "which code was fetched" and "did it arrive intact" axes.
|
|
|
|
Authors should add `sha256` to their `plugin.yaml` (generate with
|
|
`python -m molecule_agent verify-sha256 <plugin-dir>`) and commit it alongside
|
|
the plugin content.
|
|
|
|
---
|
|
|
|
## KI-007 — `_is_hex` raises `TypeError` on non-string arguments instead of returning `False`
|
|
|
|
**File:** `molecule_agent/client.py:_is_hex`
|
|
**Status:** ✅ Fixed — isinstance guard added
|
|
**Resolved in:** `fix/ki-005-ki-007` branch
|
|
**Severity:** Low
|
|
|
|
### Symptom
|
|
`_is_hex` is called inside `verify_plugin_sha256` after a length check. When
|
|
passed a non-string argument (e.g. `None`, an `int`, a `list`), `int(value, 16)`
|
|
raises `TypeError: int() can't convert non-string with explicit base` instead of
|
|
returning `False`. `verify_plugin_sha256` would surface a confusing `TypeError`
|
|
rather than a descriptive validation error.
|
|
|
|
### Impact
|
|
Any bug passing a non-string `expected` to `verify_plugin_sha256` produces a
|
|
confusing `TypeError` instead of the intended `ValueError`. Low-probability
|
|
edge case (function is internal), but violates the principle that validator
|
|
functions should never raise unexpected exceptions.
|
|
|
|
### Suggested fix
|
|
Guard at the top of `_is_hex`:
|
|
```python
|
|
def _is_hex(value: str) -> bool:
|
|
if not isinstance(value, str):
|
|
return False
|
|
try:
|
|
int(value, 16)
|
|
return True
|
|
except ValueError:
|
|
return False
|
|
```
|
|
|
|
---
|
|
|
|
## KI-008 — `test_call_peer_errors.py` fails collection due to missing `tests/conftest.py`
|
|
|
|
**File:** `tests/test_call_peer_errors.py:19`
|
|
**Status:** Identified
|
|
**Severity:** Low
|
|
|
|
### Symptom
|
|
`pytest tests/` fails to collect any tests:
|
|
```
|
|
ModuleNotFoundError: No module named 'tests.conftest'
|
|
```
|
|
The file imports `from tests.conftest import _CaptureHandler` using the
|
|
`tests.` package prefix. The cloned repo has no `conftest.py` and uses a
|
|
convention inconsistent with the rest of the test suite (which uses root-relative
|
|
imports).
|
|
|
|
### Impact
|
|
CI running `pytest tests/` errors before collecting any tests at all. Requires
|
|
`--ignore=tests/test_call_peer_errors.py` to run the full suite.
|
|
|
|
### Suggested fix
|
|
Either create `tests/conftest.py` with the `_CaptureHandler` stub definition,
|
|
or change the import to use a direct module import (`from conftest import
|
|
_CaptureHandler`) consistent with the rest of the suite.
|