Handling Missing Disk Stores
Use the gfxd list-missing-disk-stores command to find disk stores with the latest persisted data. Use gfxd revoke-missing-disk-stores to stop waiting for the data when you cannot bring it online.
Why Disk Stores Go Missing
- Disk store file corruption. You can check on this by validating the disk store.
- Incorrect distributed system configuration for the member
- Network partitioning
- Drive failure
List Missing Disk Stores
The gfxd list-missing-disk-stores command lists all disk stores with most recent data that are being waited on by other members.
For replicated tables, this command only lists missing members that are preventing other members from starting up. For partitioned tables, this command also lists any offline data stores, even when other data stores for the table are online, because their offline status may be causing PartitionOfflineExceptions in cache operations or preventing the system from satisfying redundancy.
gfxd list-missing-disk-stores Connecting to distributed system: mcast=/18.104.22.168:12348 DiskStore at straw.gemstone.com /export/straw3/users/jpearson/testGemFire/hostB/DS1
Revoke Missing Disk Stores
- Disk stores that have the most recent copy of data for one or more tables or table buckets.
- Disk stores that are unrecoverable, such as when you have deleted them, or their files are corrupted or on a disk that has had a catastrophic failure.
When you cannot bring the latest persisted copy online, use the revoke command to tell the other members to stop waiting for it. Once the store is revoked, the system finds the remaining most recent copy of data and uses that.
Use the gfxd list-missing-disk-stores command to identify the disk store you need to revoke. The revoke command takes the host and directory in input, as listed by that command.
gfxd list-missing-disk-stores Connecting to distributed system: mcast=/22.214.171.124:12348 DiskStore at straw.gemstone.com /export/straw3/users/jpearson/testGemFire/hostB/DS1 gfxd revoke-missing-disk-store straw.gemstone.com /export/straw3/users/jpearson/testGemFire/hostB/DS1 Connecting to distributed system: mcast=/126.96.36.199:12348 revocation was successful ...