Skip to main content

Salesforce.com's Oracle Grid Database cluster crash!




This must have been a lot of sweating nights for the Salesforce folks, one of Oracle's customers!
But believe me , I know how things go.
  • Database is grid control, who wants a standby.
  • 10g is easy, who wants a backup dba. Well can we fire the dba too? Sysadmin's can do it, no?
  • Everything is on SAN, someone will restore it.
  • Cluster crash? Never heard of it, funny the series that I'm writing (part V) which spoke about understanding your architecture and proactively work towards it's continuity.
  • MTTR, what is that?
  • MTTR, we have it set to 3 mins! (Hey have you ever tested it? Anywhere! Somewhere!)
  • Backup restore, have you tested it?
  • Do you have a valid test environment?
  • Do you have anything that looks like a test environment? Anything? Something?
I know the management team there is looking hard for someone to blame. I just hope the poor sysadmin or dba isn't the only one who will take the heat! Management ought to stand up to take it's responsibility as well.

We need to understand together
  • disks fail
  • clusters crash (with all kinds of errors which need desperate attention all the time!)
  • backups fail
  • restores fail
  • It's always happening when you're asleep
  • It happens most of the times in weekends
Technologies like grid computing or RAC etc are thoroughly tested technologies. What we do need to realize is that we cannot just rely on technologies but also have a proper plan for business continuity!

And I don't think hatred had anything to do with it, or did it?

Comments

Popular posts from this blog

Security: VMware Workstation 6 vulnerability

vulnerable software: VMware Workstation 6.0 for Windows, possible some other VMware products as well type of vulnerability: DoS, potential privilege escalation I found a vulnerability in VMware Workstation 6.0 which allows an unprivileged user in the host OS to crash the system and potentially run arbitrary code with kernel privileges. The issue is in the vmstor-60 driver, which is supposed to mount VMware images within the host OS. When sending the IOCTL code FsSetVoleInformation with subcode FsSetFileInformation with a large buffer and underreporting its size to at max 1024 bytes, it will underrun and potentially execute arbitrary code. Security focus

OS Virtualization comparison: Parallels' Virtuozzo vs the rest

Virtuozzo's main differentiators versus hypervisors center on overhead, virtualization flexibility, administration and cost. Virtuozzo requires significantly less overhead than hypervisor solutions, generally in the range of 1% to 5% compared with 7% to 25% for most hypervisors, leaving more of the system available to run user workloads. Customers can also virtualize a wider range of applications using Virtuozzo, including transactional databases, which often suffer from performance problems when used with hypervisors. On the administration side, customers need to manage, maintain and secure just a single OS instance, while the hypervisor model requires customers to manage many OS instances. Of course, the hypervisor vendors have worked hard to automate much of this process, but it still requires more effort to manage and maintain multiple operating systems than a single instance. Finally, OS virtualization with Virtuozzo has a lower list price than the leading hypervisor for comme...