Active Incident

Updated a few seconds ago

Partial Service Disruption - MetricsPartial Service Disruption

Incident Status

Partial Service Disruption

Components

CoreWeave Services

Locations

US-ORD1, US-LAS1, US-LGA1



April 26, 2024 6:56PM UTC
[Identified] We are implementing additional remediation efforts to improve the availability of our metrics as previously described in: https://status.coreweave.com/pages/incident/5e126e998f2f032e1f8f0f4b/661fda9cea7df3052fa28181 As we continue our efforts, you may see periodic interruptions in services such as Virtual Server UI, Cloud Metrics, and the Availability API. We apologize for any inconvenience this may cause and thank you for your patience while we work to resolve the matter in a timely manner.

April 26, 2024 7:09PM UTC
[Monitoring] These remediation efforts will continue over the next few weeks as we work to ensure that our metrics are stabilized. Thank you again for your patience.

CoreWeave Cloud




Operational

CoreWeave Inference




Operational

Concierge Render




Operational

CoreWeave Shared Filesystem Volumes




Operational

CoreWeave Object Storage




Operational

CoreWeave Block Storage Volumes




Operational

CoreWeave Services




Partial Service Disruption

CoreWeave Premium Shared Filesystem Volumes




Operational

0

Upcoming Maintenances

6

Incidents Last 30 Days

9

Maintenances Last 30 Days

External Services

History (Last 7 days)

Description

At 2:00 PM EST we will be performing necessary control-plane maintenance to expand load capacity. This is not expected to cause disruption.


Components

CoreWeave Cloud, CoreWeave Inference, Concierge Render, CoreWeave Shared Filesystem Volumes, CoreWeave Object Storage, CoreWeave Block Storage Volumes, CoreWeave Services, CoreWeave Premium Shared Filesystem Volumes


Locations

US-ORD1, US-LAS1, US-LGA1


Schedule

April 26, 2024 6:00PM - April 26, 2024 8:00PM UTC



April 26, 2024 6:15PM UTC
[Update] Starting roll of control plane api servers

April 26, 2024 6:36PM UTC
[Update] Maintenance complete

Incident Status

Partial Service Disruption


Components

CoreWeave Services


Locations

US-ORD1, US-LAS1, US-LGA1




April 17, 2024 2:20PM UTC
[Monitoring] We are actively working to remediate an issue that may be impacting the availability of our metrics. During this time, you may see interruptions in services such as: Grafana, Cloud Metrics, and Availability API. We apologize for any inconvenience this may cause and thank you for your patience while we work to resolve the matter in a timely manner.

April 18, 2024 4:11PM UTC
[Monitoring] We are still working on our efforts towards remediation, and we anticipate completing our mitigation work within the next 24 hours. Thank you for your patience, and please reach out to [email protected] if you have additional questions.

April 19, 2024 4:35PM UTC
[Monitoring] We are continuing our efforts to remediate this issue. In the interim, you may continue to see partial disruptions to our metrics. Thank you again for your patience as we continue to work through this issue.

April 20, 2024 12:13AM UTC
[Monitoring] We are continuing our efforts towards remediation, and have made additional progress towards mitigating this issue. As mitigation continues, there may be periodic metrics interruptions, which we are actively working to resolve. Please reach out to [email protected] if you have additional questions or experience interruptions.

April 24, 2024 12:45AM UTC
[Resolved] We have completed our remediation efforts, and have been monitoring our metrics for the past few hours with further incident. We believe the issue to be resolved and in a stable state.