Hardware Management/FMFM
Jump to navigation
Jump to search
Welcome to the OCP Fleetscale Memory Fault Management (FMFM) WIKI
Fleetscale Memory Fault Management is a Worksteam within the Hardware Management Project.
Leadership
Scope
The FMFM is a workstream about standardization of Fleetscale Memory Fault Management
- Proposed topics:
- Standardize vendor agnostic architecture for memory error handling
- Modularization of inputs from different hardware vendors
- APIs and connections between different modules from different vendors.
- Define the output of each module (failure cause, health information, RAS actions, etc.)
- Standardize memory error telemetry
- Format content for better fleet scale RAS management
- Troubleshooting, FRU replacement policies, etc.
- Coordinate with the broader OCP group to make sure there is a path for this general architecture
Get Involved
Subproject Meets Biweekly on Tuesday from 7-9 am PST
- - Link to the FMFM Calendar
- - Link to the Meeting
- - You can also dial in using your phone : United States: +1 (646) 749-3112 Access Code: 454-746-381
Mailing List
Participate in the discussion:
- - FMFM on OCP Groups.io: FMFM Group Link
- - Subscribe to mailing list
- - Post to mailing list
Review and provide Feedback
Documents
Link to Fleetscale Memory Fault Management (FMFM) Workstream Proposal on Google Drive
Link to Fleetscale Memory Fault Management (FMFM) Framework Requirements Google Drive