Skip to content

Review and integrate OpenShift AIOps Self-Healing Platform pattern documentation (PR #652) #653

@tosin2013

Description

@tosin2013

Overview

PR #652 introduces comprehensive documentation for the OpenShift AIOps Self-Healing Platform validated pattern - an AI-powered, GitOps-deployed solution providing intelligent, automated incident response for OpenShift clusters.

PR: #652
Author: @tosin2013
Size: XXL (3,542 additions, 0 deletions)
Status: Open - awaiting review

Documentation Scope

The PR adds the following documentation components:

Core Documentation

  • Pattern overview and introduction
  • Getting started guide with deployment instructions
  • Architecture documentation covering:
    • Coordination Engine
    • MCP Server integration
    • KServe model serving

Deployment Guides

  • High Availability (HA) cluster deployment
  • Single Node OpenShift (SNO) deployment
  • Cluster sizing recommendations (HA: 7 nodes, ~57 cores, ~185GB RAM)

Configuration & Customization

  • Extensive customization guide (1,600+ lines) with real-world production examples:
    • Multi-model inference serving (anomaly detection, predictive analytics)
    • Mixed storage configurations (CephFS + gp3-csi)
    • Platform configuration tuning via ConfigMap
    • Complete self-healing workflow
  • Storage configuration patterns (ODF, CephFS, cloud storage)

Development Resources

  • Jupyter notebook catalog (33 notebooks)
  • Development workflow documentation
  • Workshop materials and references

Review Checklist

Documentation team should verify:

  • Hugo build: Confirm site builds successfully without errors (currently builds 483 pages)
  • AsciiDoc formatting: No parsing errors, proper use of modular doc structure
  • Code examples: All YAML/Python code examples are properly formatted and functional
  • Cross-references: All internal links and cross-references are valid
  • Real-world examples: Verify examples align with current cluster deployments
  • External links: Check GitHub pattern repo and workshop links are correct and stable
  • Technical accuracy: Validate cluster sizing and storage configuration recommendations
  • Style consistency: Ensure documentation follows existing pattern documentation style
  • Accessibility: Review AsciiDoc formatting for accessibility compliance

Action Items

  1. Initial Review: Documentation team to review structure and formatting
  2. Technical Review: SMEs to verify technical accuracy of architecture, sizing, and configuration
  3. QE Review: Quality Engineering to test examples and verify build process
  4. Final Approval: Obtain approval from reviewers (mention: Andrew Beekhof, Drew Minnear if appropriate)
  5. Squash commits: Ensure all commits are squashed before merge
  6. Merge: Complete integration into main documentation

Success Criteria

  • All documentation builds without errors
  • Real-world examples validated against current deployment
  • Reviewers provide approval
  • No broken cross-references
  • Consistent with existing documentation style and guidelines

Additional Information

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions