Hardware Management/FMFM: Difference between revisions
Jump to navigation
Jump to search
Kevin.kifer (talk | contribs) |
|||
(8 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
==Welcome to the OCP Fleetscale Memory Fault Management (FMFM) WIKI== | ==Welcome to the OCP Fleetscale Memory Fault Management (FMFM) WIKI== | ||
Fleetscale Memory Fault Management is a Worksteam within the Hardware Management Project. | Fleetscale Memory Fault Management is a Worksteam within the Hardware Management Project. | ||
[https://www.opencompute.org/wiki/Hardware_Management Hardware Management Project] | [https://www.opencompute.org/wiki/Hardware_Management Hardware Management Project] | ||
==Leadership== | |||
* [mailto:shen.zhou@intel.com Shen Zhou] | * [mailto:shen.zhou@intel.com Shen Zhou] | ||
* [mailto:acwalton@google.com Drew Walton] | * [mailto:acwalton@google.com Drew Walton] | ||
* [mailto:yogesh.varmau@intel.com Yogesh Varma] | * [mailto:yogesh.varmau@intel.com Yogesh Varma] | ||
==Scope== | |||
The FMFM is a workstream about standardization of Fleetscale Memory Fault Management | The FMFM is a workstream about standardization of Fleetscale Memory Fault Management | ||
*Proposed topics: | *Proposed topics: | ||
<ol> | <ol> | ||
Line 32: | Line 35: | ||
==Get Involved== | ==Get Involved== | ||
===Subproject Meets Biweekly on Tuesday from 7:10-8 am PST === | |||
* [https://ocp-all.groups.io/g/OCP-HWMgt-FMFM/calendar Link to the FMFM Calendar] | |||
* [https://global.gotomeeting.com/join/454746381 Link to the Meeting] | |||
* You can also dial in using your phone : United States: +1 (646) 749-3112 Access Code: 454-746-381 | |||
===Mailing List === | |||
Participate in the discussion: | Participate in the discussion: | ||
* FMFM on OCP Groups.io: [https://ocp-all.groups.io/g/OCP-HWMgt-FMFM FMFM Group Link] | |||
* [mailto:OCP-HWMgt-FMFM+subscribe@OCP-All.groups.io Subscribe to mailing list] | |||
* [mailto:OCP-HWMgt-FMFM@OCP-All.groups.io Post to mailing list] | |||
=== | ===Documents=== | ||
* [https://docs.google.com/document/d/1xmDmlXKMluo4WzuhGYWwChA8rdvjli96Ev2HO7tABM8/edit#heading=h.s2v3a52onncc Link to Fleetscale Memory Fault Management (FMFM) Workstream Proposal] | |||
* [https://docs.google.com/document/d/1LtEmytHLozJ8sBAdshO5KYtQgE-PwB6os4NLVrmXETM/edit?usp=sharing Link to Fleetscale Memory Fault Management (FMFM) Framework Requirements] | |||
=== | ===Past Presentation Recordings=== | ||
[https:// | * [https://www.youtube.com/watch?v=ZeqgPE9IC_o&list=PLAG-eekRQBShiPHyTkmsO_VbkHtmsmnba&index=3 Link to FMFM Talk at 2023 OCP Global Summit] | ||
=== | ===FMFM Weekly Call Recordings=== | ||
: | * [https://www.youtube.com/watch?v=B5mrVIVD29I Jul 16, 2024] | ||
: | * [https://www.youtube.com/watch?v=x5MDQ3ifCEA Jun 18, 2024] | ||
:- [https:// | * [https://www.youtube.com/watch?v=HENCfonFnIg Jun 04, 2024] | ||
: | * [https://www.youtube.com/watch?v=HwedI9K_Ask May 21, 2024] | ||
: | * [https://www.youtube.com/watch?v=otOS3iJZBQg May 07, 2024] | ||
* [https://www.youtube.com/watch?v=SQw1UNmVu-A Apr 23, 2024] | |||
* [https://www.youtube.com/watch?v=pSO5I1uWC9A Apr 09, 2024] | |||
* [https://www.youtube.com/watch?v=D71VSdtm7Sg Mar 26, 2024] | |||
* [https://www.youtube.com/watch?v=_HC81xZJpQ0 Mar 12, 2024] | |||
* [https://www.youtube.com/watch?v=vziNL9RSrOQ Feb 27, 2024] | |||
* [https://www.youtube.com/watch?v=tr8AXCgMF-8 Feb 13, 2024] | |||
* [https://www.youtube.com/watch?v=IzsxR2O3ioU Jan 30, 2024] | |||
* [https://www.youtube.com/watch?v=sk3rIiS_o5U Jan 16, 2024] | |||
* [https://www.youtube.com/watch?v=GNeNMnr87Qg Jan 2, 2024] | |||
* [https://www.youtube.com/watch?v=vm4c8FhY6N8 Dec 5, 2023] | |||
* [https://www.youtube.com/watch?v=Z1Pwi5TpwRA Nov 21, 2023] | |||
* [https://www.youtube.com/watch?v=hr1pq6WfrcA Nov 7, 2023] | |||
* [https://www.youtube.com/watch?v=BV_WZxRmGUw Oct 24, 2023] | |||
* [https://www.youtube.com/watch?v=XtLSclRThIM Oct 10, 2023] | |||
* [https://www.youtube.com/watch?v=5aQh4Ke__b8 Sep 26, 2023] | |||
* [https://www.youtube.com/watch?v=1K0MtdNSd_Q Sep 12, 2023] | |||
* [https://www.youtube.com/watch?v=-DfMNSUB6e4 Aug 29, 2023] |
Latest revision as of 17:42, 16 July 2024
Welcome to the OCP Fleetscale Memory Fault Management (FMFM) WIKI[edit]
Fleetscale Memory Fault Management is a Worksteam within the Hardware Management Project.
Leadership[edit]
Scope[edit]
The FMFM is a workstream about standardization of Fleetscale Memory Fault Management
- Proposed topics:
- Standardize vendor agnostic architecture for memory error handling
- Modularization of inputs from different hardware vendors
- APIs and connections between different modules from different vendors.
- Define the output of each module (failure cause, health information, RAS actions, etc.)
- Standardize memory error telemetry
- Format content for better fleet scale RAS management
- Troubleshooting, FRU replacement policies, etc.
- Coordinate with the broader OCP group to make sure there is a path for this general architecture
Get Involved[edit]
Subproject Meets Biweekly on Tuesday from 7:10-8 am PST[edit]
- Link to the FMFM Calendar
- Link to the Meeting
- You can also dial in using your phone : United States: +1 (646) 749-3112 Access Code: 454-746-381
Mailing List[edit]
Participate in the discussion:
- FMFM on OCP Groups.io: FMFM Group Link
- Subscribe to mailing list
- Post to mailing list
Documents[edit]
- Link to Fleetscale Memory Fault Management (FMFM) Workstream Proposal
- Link to Fleetscale Memory Fault Management (FMFM) Framework Requirements