IN THIS ARTICLE
Outlines 2 and 3 drive failure protection in Qumulo Core
REQUIREMENTS
- Cluster with 4 or more nodes running Qumulo Core for 2 Drive Failure Protection
- Cluster with 5 or more nodes running Qumulo Core 2.7.10 and above for 3 Drive Failure Protection
NOTE: The recommended 2 or 3 drive protection level will be selected by default based on the cluster size and node type at cluster creation. For more information, reference the Create a Qumulo Cluster with 2.7.10 and above article.
2 DRIVE FAILURE PROTECTION
System State | Severity | Color | UI Messsage | Protection State |
Nodes & Drives are fully operational | None | Green | Cluster is balanced and protected for up to 2 drive failures | Data is Protected and Balanced |
1 drive failure Data is reprotecting |
Medium | Yellow |
A drive has failed. The cluster can sustain [x] additional drive failure[s], but you should replace the failed drive. |
Data is Reprotecting |
1 drive failure Data is protected |
Medium | Yellow |
A drive has failed. The cluster can sustain [x] additional drive failure[s], but you should replace the failed drive. |
Data is Protected and Balanced |
2 drive failure Data is reprotecting |
High | Red |
2 drives have failed. This is the maximum your cluster can tolerate without data loss. You should replace the failed drives ASAP. |
Data is Reprotecting |
2 drive failure Data is protected |
Medium | Yellow |
2 drives have failed. This is the maximum your cluster can tolerate without data loss. You should replace the failed drives ASAP. |
Data is Protected and Balanced |
2+ drive failures | High | Red |
[x] drives have failed. This is the maximum your cluster can tolerate without data loss. You should replace the failed drives ASAP. |
Data is available, but we are unable to run reprotect |
1 node offline |
Medium | Red |
Any additional failures could result in data loss. All nodes must be online before data can be reprotected. |
Data is Protected |
• 1 drive failure & 1 node offline • 2+ nodes offline • 2+ drive failure & any node offline |
Very High | Red |
We are unable to communicate with your cluster at this time. |
Emergency situation |
3 DRIVE FAILURE PROTECTION
System State | Severity | Color | UI Messsage | Protection State |
Nodes & Drives are fully operational | None | Green | Cluster is balanced and protected for up to 3 drive failures | Data is Protected and Balanced |
1 drive failure Data is reprotecting |
Medium | Yellow |
A drive has failed. The cluster can sustain [x] additional drive failure[s], but you should replace the failed drive. |
Data is Reprotecting
|
1 drive failure Data is protected |
Medium | Yellow |
A drive has failed. The cluster can sustain [x] additional drive failure[s], but you should replace the failed drive. |
Data is Protected and Balanced |
2 drive failure Data is reprotecting |
Medium | Yellow |
2 drives have failed. The cluster can sustain [x] additional drive failure[s] but you should replace the failed drives. |
Data is Reprotecting |
2 drive failure Data is protected |
Medium | Yellow |
2 drives have failed. The cluster can sustain [x] additional drive failure[s] but you should replace the failed drives. |
Data is Protected and Balanced |
3 drive failure Data is reprotecting |
High | Red |
3 drives have failed. This is the maximum your cluster can tolerate without data loss. You should replace the failed drives ASAP. |
Data is Reprotecting |
3 drive failure Data is protected |
Medium | Yellow |
3 drives have failed. The cluster can sustain [x] additional drive failure[s] but you should replace the failed drives ASAP. |
Data is Protected and Balanced |
3+ drive failure Data is reprotecting |
High | Red |
[x] drives have failed. This is the maximum your cluster can tolerate without data loss. You should replace the failed drives ASAP. |
Data is available, but we are unable to run reprotect |
1 node offline |
Medium | Red |
Data is protected from 1 additional drive failure or 0 node failures. |
Data is Protected |
1 drive failure and 1 node offline |
High | Red |
Any additional failures could result in data loss. All nodes must be online before data can be reprotected. |
Data is Protected |
|
Very High | Red |
We are unable to communicate with your cluster at this time. |
Emergency situation |
RESOLUTION
You should now have an overall understanding of 2 and 3 drive failure protection in Qumulo Core
ADDITIONAL RESOURCES
Create a Qumulo Cluster with 2.7.10 and above
Qumulo Care Proactive Monitoring
Video: HDD Field Replacement Unit
Like what you see? Share this article with your network!
Comments
0 comments
Please sign in to leave a comment.