Files

yv01p 9962df50ee feat(core): register IContentCrudService in DI container

Adds IContentCrudService registration to UmbracoBuilder alongside
IContentService. Both services are now resolvable from DI.

Includes integration test verifying successful resolution.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

2025-12-21 03:31:23 +00:00

17 KiB

Raw Blame History

Critical Implementation Review #6: Phase 1 ContentService CRUD Extraction

Document: docs/plans/2025-12-20-contentservice-refactor-phase1-implementation.md (v1.6) Reviewer: Claude (Opus 4.5) Date: 2025-12-21 Review Type: Strict Tactical Implementation Review

1. Overall Assessment

Strengths

Well-structured plan with clear task decomposition, version history, and 5-iteration review cycle
Comprehensive issue resolution - previous reviews addressed N+1 queries, lock ordering, scope management, and thread safety
Strong documentation of preconditions for internal methods (SaveLocked, DeleteLocked, GetPagedDescendantsLocked)
Baseline-enforced regression testing with configurable thresholds and CI-friendly environment variables
Production-ready versioning policy with 2-major-version deprecation periods and additive-only interface changes
Consistent lock acquisition patterns for most operations (ContentTree, ContentTypes, Languages)

Major Concerns

Delete operation has inconsistent scope.Complete() behavior on cancellation compared to batch Save
Sync-over-async pattern in Audit() still poses deadlock/thread exhaustion risk despite ConfigureAwait(false)
StaticServiceProvider usage for obsolete constructors introduces untestable code paths (explicitly discouraged in codebase)
Delete acquires lock AFTER notification - creates race window (differs from Save's lock-first pattern)

2. Critical Issues (P0)

2.1 Delete Cancellation Calls `scope.Complete()` - Inconsistent with Save

Location: Task 3, Delete method (lines 1308-1342 in plan)

Code in question:

// Delete method (lines 1313-1319):
if (scope.Notifications.PublishCancelable(new ContentDeletingNotification(content, eventMessages)))
{
    scope.Complete();  // <-- Calls Complete on cancel
    return OperationResult.Cancel(eventMessages);
}

Contrast with batch Save (lines 1268-1273):

if (scope.Notifications.PublishCancelable(savingNotification))
{
    return OperationResult.Cancel(eventMessages);  // No scope.Complete() - fixed in review 5
}

Verification: The original ContentService.Delete at line 2296-2297 does call scope.Complete() on cancel, so this maintains behavioral parity. However, this creates internal inconsistency within ContentCrudService.

Impact:

Inconsistent transaction behavior between operations
Potential confusion for developers maintaining the code
Scope.Complete() on cancel could theoretically commit partial state (though unlikely in cancel path)

Recommendation:

// Option A: Document the inconsistency (preserves parity)
if (scope.Notifications.PublishCancelable(new ContentDeletingNotification(content, eventMessages)))
{
    // NOTE: scope.Complete() called on cancel for behavioral parity with original ContentService.Delete.
    // This differs from Save operations which do NOT complete scope on cancel.
    // See: src/Umbraco.Core/Services/ContentService.cs line 2296
    scope.Complete();
    return OperationResult.Cancel(eventMessages);
}

// Option B: Align with Save (breaking change from original)
if (scope.Notifications.PublishCancelable(new ContentDeletingNotification(content, eventMessages)))
{
    _logger.LogInformation("Delete operation cancelled for content {ContentId} ({ContentName}) by notification handler",
        content.Id, content.Name);
    return OperationResult.Cancel(eventMessages);  // No scope.Complete()
}

Required action: Choose Option A or B and update plan accordingly.

2.2 Sync-over-Async Deadlock Risk in `Audit()` Method

Location: Task 1, ContentServiceBase.cs (lines 151-163 in plan)

Code in question:

protected void Audit(AuditType type, int userId, int objectId, string? message = null, string? parameters = null)
{
    // Use ConfigureAwait(false) to avoid context capture and potential deadlocks
    Guid userKey = UserIdKeyResolver.GetAsync(userId).ConfigureAwait(false).GetAwaiter().GetResult();

    AuditService.AddAsync(
        type,
        userKey,
        objectId,
        UmbracoObjectTypes.Document.GetName(),
        message,
        parameters).ConfigureAwait(false).GetAwaiter().GetResult();
}

Impact:

Thread pool exhaustion: Each sync call blocks a thread waiting for async completion
Potential deadlocks: If underlying async operations have sync dependencies or limited concurrency resources
Performance degradation: Under high load, blocked threads accumulate
The TODO comment (line 149-150) acknowledges this but provides no timeline or mitigation

Recommendation:

Add obsolete warning to sync method:

/// <summary>
/// Records an audit entry for a content operation (synchronous).
/// </summary>
/// <remarks>
/// <para><strong>Warning:</strong> This method uses sync-over-async pattern which can cause
/// thread pool exhaustion under high load. Prefer <see cref="AuditAsync"/> for new code.</para>
/// <para>TODO: Replace with sync overloads when IAuditService.Add and IUserIdKeyResolver.Get are available.</para>
/// </remarks>
[Obsolete("Prefer AuditAsync for new code. Sync wrapper may cause thread pool exhaustion under high load.")]
protected void Audit(AuditType type, int userId, int objectId, string? message = null, string? parameters = null)

Track sync Audit() callers for future migration to AuditAsync()

Required action: Add [Obsolete] attribute with warning message.

2.3 StaticServiceProvider Makes Obsolete Constructors Untestable

Location: Task 5, obsolete constructor specification (lines 1803-1807 in plan)

Code in question:

// NEW: Lazy resolution of IContentCrudService
_crudServiceLazy = new Lazy<IContentCrudService>(() =>
    StaticServiceProvider.Instance.GetRequiredService<IContentCrudService>(),
    LazyThreadSafetyMode.ExecutionAndPublication);

StaticServiceProvider documentation (verified in codebase):

/// <remarks>
///     Keep in mind, every time this is used, the code becomes basically untestable.
/// </remarks>
[EditorBrowsable(EditorBrowsableState.Never)]
public static class StaticServiceProvider

Impact:

Unit tests cannot mock IContentCrudService when using obsolete constructors
Migration scenarios from old constructor to new constructor are untestable at unit level
Goes against the codebase's own documented guidance

Recommendation:

Add XML documentation warning:

/// <remarks>
/// <para><strong>Testing limitation:</strong> This constructor uses StaticServiceProvider for
/// IContentCrudService resolution, making it untestable at unit level. Integration tests
/// should verify CRUD delegation when using this constructor.</para>
/// </remarks>
[Obsolete("Use the non-obsolete constructor instead. Scheduled removal in v19.")]
public ContentService(...)

Add integration test verifying obsolete constructor behavior:

/// <summary>
/// Verifies obsolete constructor correctly delegates to IContentCrudService via StaticServiceProvider.
/// Required because obsolete constructor is not unit-testable due to StaticServiceProvider usage.
/// </summary>
[Test]
public void ObsoleteConstructor_DelegatesToContentCrudService()
{
    // Integration test using actual DI container
    var contentService = GetRequiredService<IContentService>();
    var content = contentService.Create("Test", -1, ContentType.Alias);

    Assert.That(content, Is.Not.Null);
    Assert.That(content.Name, Is.EqualTo("Test"));
}

Required action: Add documentation warning and integration test.

2.4 Delete Acquires Lock AFTER Notification - Race Window

Location: Task 3, Delete method (lines 1308-1322 in plan)

Code in question:

using (ICoreScope scope = ScopeProvider.CreateCoreScope())
{
    // Notification sent BEFORE lock acquired
    if (scope.Notifications.PublishCancelable(new ContentDeletingNotification(content, eventMessages)))
    {
        scope.Complete();
        return OperationResult.Cancel(eventMessages);
    }

    // Lock acquired AFTER notification
    scope.WriteLock(Constants.Locks.ContentTree);

    // ... deletion proceeds
}

Contrast with Save (post-review 4):

using (ICoreScope scope = ScopeProvider.CreateCoreScope())
{
    // Lock acquired BEFORE notification
    scope.WriteLock(Constants.Locks.ContentTree);
    scope.ReadLock(Constants.Locks.Languages);

    // Notification sent AFTER lock acquired
    if (scope.Notifications.PublishCancelable(savingNotification))
    {
        return OperationResult.Cancel(eventMessages);
    }

Verification: Original ContentService.Delete (line 2294-2300) follows the same pattern - notification before lock. This maintains behavioral parity.

Impact:

Race window between notification and lock: content could be modified/deleted by another thread
Notification handlers see potentially stale content state
Inconsistent pattern with Save operations (though matches original Delete)

Recommendation: Document this as intentional for parity:

public OperationResult Delete(IContent content, int userId = Constants.Security.SuperUserId)
{
    EventMessages eventMessages = EventMessagesFactory.Get();

    using (ICoreScope scope = ScopeProvider.CreateCoreScope())
    {
        // NOTE: Notification sent BEFORE lock for behavioral parity with original ContentService.Delete.
        // This differs from Save operations which acquire lock first.
        // Race window exists between notification and lock - accepted for backward compatibility.
        // See: src/Umbraco.Core/Services/ContentService.cs line 2294
        if (scope.Notifications.PublishCancelable(new ContentDeletingNotification(content, eventMessages)))

Required action: Add comment documenting intentional parity.

3. High Priority Issues (P1)

3.1 Missing `RequireBaseline` CI Documentation

Location: Task 6, benchmark enforcement

Problem: BENCHMARK_REQUIRE_BASELINE environment variable is implemented (line 1893) but not documented in CI setup instructions. Benchmarks will silently skip regression checks if baseline file is missing.

Recommendation: Add to Task 6 Step 5 commit message:

CI Configuration:
- Set BENCHMARK_REQUIRE_BASELINE=true in CI to fail on missing baselines
- Set BENCHMARK_REGRESSION_THRESHOLD=20 (default) or custom percentage

3.2 GetAncestors Warning Log False Positive Edge Case

Location: Task 3, GetAncestors (lines 1062-1067 in plan)

Code in question:

// Log warning if path appears malformed (expected ancestors but found none)
if (ancestorIds.Length == 0 && content.Level > 1)
{
    _logger.LogWarning(
        "Malformed path '{Path}' for content {ContentId} at level {Level} - expected {ExpectedCount} ancestors but parsed {ActualCount}",
        content.Path, content.Id, content.Level, content.Level - 1, ancestorIds.Length);
}

Problem: Edge case at Level 2 with path "-1,{selfId}" - content directly under root has 0 ancestors (root is skipped), but Level 2 implies 1 expected ancestor. Warning would incorrectly fire.

Recommendation: Adjust expected count calculation:

// Expected ancestors = Level - 1 (levels above) - 1 (root is skipped in parsing)
var expectedAncestorCount = content.Level - 2;  // Level 2 expects 0, Level 3 expects 1, etc.
if (ancestorIds.Length < expectedAncestorCount && expectedAncestorCount > 0)
{
    _logger.LogWarning(
        "Malformed path '{Path}' for content {ContentId} at level {Level} - expected {ExpectedCount} ancestors but parsed {ActualCount}",
        content.Path, content.Id, content.Level, expectedAncestorCount, ancestorIds.Length);
}

3.3 Unit Test Mock May Not Exercise Intended Code Path

Location: Task 3, unit test Save_WithVariantContent_CallsLanguageRepository (lines 718-751)

Concern: The test mocks _languageRepository.GetMany() to return empty list, but the verification asserts it's called Times.Once. Need to verify the test data setup actually triggers the variant save path.

Recommendation: Add explicit assertion that variant path was taken:

// Verify we actually entered the variant code path
Assert.That(content.Object.ContentType.VariesByCulture(), Is.True,
    "Test setup error: ContentType should vary by culture");
Assert.That(cultureInfoDict.Object.Values.Any(x => x.IsDirty()), Is.True,
    "Test setup error: Should have dirty culture infos");

4. Medium Priority Issues (P2)

4.1 Benchmark Threshold Too Tight for Low-Millisecond Operations

Location: Task 6, threshold table

Problem: 20% threshold on 7ms baseline allows only 1.4ms variance (8.4ms max). System timing jitter can easily exceed this.

Current thresholds:

Benchmark	Baseline	20% Max	Variance Allowed
`Save_SingleItem`	7ms	8.4ms	1.4ms
`GetById_Single`	8ms	9.6ms	1.6ms

Recommendation: Consider two-tier threshold or absolute floor:

protected void AssertNoRegression(string name, long elapsedMs, int itemCount, double thresholdPercent = -1)
{
    var effectiveThreshold = thresholdPercent < 0 ? RegressionThreshold : thresholdPercent;

    if (Baseline.TryGetValue(name, out var baselineResult))
    {
        // Two-tier: For fast operations, allow at least 5ms absolute variance
        var absoluteFloor = 5.0;
        var percentageAllowance = baselineResult.ElapsedMs * (effectiveThreshold / 100);
        var maxAllowed = baselineResult.ElapsedMs + Math.Max(absoluteFloor, percentageAllowance);
        // ...
    }
}

4.2 Missing `[Category("Benchmark")]` Attribute in Plan

Location: Task 6, benchmark test examples

Problem: Plan shows [LongRunning] attribute but filter commands use Category=Benchmark. Need to ensure consistency.

Recommendation: Verify benchmark tests have both attributes:

[Test]
[Category("Benchmark")]
[LongRunning]
public void Benchmark_Save_SingleItem()

4.3 Plan Summary Method Count Minor Discrepancy

Location: Summary section (line 2166-2171)

Current: Lists "23 public" methods with breakdown totaling 23.

Actual count from interface (lines 286-491):

Create: 6 methods
Read: 7 methods (GetById x2, GetByIds x2, GetRootContent, GetParent x2)
Read Tree: 7 methods (GetAncestors x2, GetPagedChildren, GetPagedDescendants, HasChildren, Exists x2)
Save: 2 methods
Delete: 1 method
Total: 23 methods

Status: Count is correct. No action needed.

5. Questions for Clarification

#	Question	Impact	Context
1	Should Delete's `scope.Complete()` on cancellation be preserved (parity) or removed (consistency with Save)?	P0	Issue 2.1
2	What's the timeline for sync `IAuditService.Add` / `IUserIdKeyResolver.Get` to eliminate sync-over-async?	P0	Issue 2.2
3	Are existing `ContentService` integration tests sufficient to cover `ContentCrudService` via delegation?	P1	Test coverage
4	Should Delete lock timing be changed to lock-first (like Save) or kept for parity?	P0	Issue 2.4

6. Implementation Checklist Additions

Add to existing checklist in plan:

From Critical Review 6 (Required)

Issue 2.1: Document or fix Delete's scope.Complete() on cancellation
Issue 2.2: Add [Obsolete] to sync Audit() method with deadlock warning
Issue 2.3: Add XML docs warning about obsolete constructor untestability
Issue 2.3: Add integration test for obsolete constructor CRUD delegation
Issue 2.4: Add comment documenting Delete lock timing is intentional for parity

From Critical Review 6 (Recommended)

Issue 3.1: Document BENCHMARK_REQUIRE_BASELINE in CI setup
Issue 3.2: Fix GetAncestors expected ancestor count calculation
Issue 3.3: Add setup validation assertions to variant content unit test
Issue 4.1: Consider absolute threshold floor for fast benchmarks
Issue 4.2: Verify [Category("Benchmark")] on all benchmark tests

7. Final Recommendation

Approve with Changes

The plan has undergone thorough review and is well-structured. The remaining issues are addressable:

Required before implementation (P0):

Document Delete's scope.Complete() behavior (issue 2.1) - choose parity or consistency
Add [Obsolete] warning to sync Audit() method (issue 2.2)
Document obsolete constructor untestability and add integration test (issue 2.3)
Document Delete lock timing rationale (issue 2.4)

Strongly recommended (P1): 5. Document BENCHMARK_REQUIRE_BASELINE for CI (issue 3.1) 6. Fix GetAncestors warning log edge case (issue 3.2)

Optional improvements (P2): 7. Consider absolute threshold floor for fast benchmarks (issue 4.1)

Appendix: Verification Commands Used

# Verified StaticServiceProvider exists and is documented as untestable
grep -r "StaticServiceProvider" src/Umbraco.Core/

# Verified original ContentService.Delete behavior
grep -A 30 "OperationResult Delete" src/Umbraco.Core/Services/ContentService.cs

# Verified baseline file exists and has expected structure
cat docs/plans/baseline-phase0.json

Review Status: Complete Next Step: Address P0 issues and update plan to v1.7, then proceed to implementation

17 KiB Raw Blame History

Critical Implementation Review #6: Phase 1 ContentService CRUD Extraction

1. Overall Assessment

Strengths

Major Concerns

2. Critical Issues (P0)

2.1 Delete Cancellation Calls scope.Complete() - Inconsistent with Save

2.2 Sync-over-Async Deadlock Risk in Audit() Method

2.3 StaticServiceProvider Makes Obsolete Constructors Untestable

2.4 Delete Acquires Lock AFTER Notification - Race Window

3. High Priority Issues (P1)

3.1 Missing RequireBaseline CI Documentation

3.2 GetAncestors Warning Log False Positive Edge Case

3.3 Unit Test Mock May Not Exercise Intended Code Path

4. Medium Priority Issues (P2)

4.1 Benchmark Threshold Too Tight for Low-Millisecond Operations

4.2 Missing [Category("Benchmark")] Attribute in Plan

4.3 Plan Summary Method Count Minor Discrepancy

5. Questions for Clarification

6. Implementation Checklist Additions

From Critical Review 6 (Required)

From Critical Review 6 (Recommended)

7. Final Recommendation

Approve with Changes

Appendix: Verification Commands Used

17 KiB

Raw Blame History

2.1 Delete Cancellation Calls `scope.Complete()` - Inconsistent with Save

2.2 Sync-over-Async Deadlock Risk in `Audit()` Method

3.1 Missing `RequireBaseline` CI Documentation

4.2 Missing `[Category("Benchmark")]` Attribute in Plan