• MyOxy
  • Offices & Services
  • Alumni
  • Newsroom
  • Calendars

Occidental College

Our StoryAdmission & AidAcademicsLife at OxyLos AngelesOxy VoicesGivingGo Tigers!

Information Resources

  • Information Resources
  • Blog
  • So what is central storage and why are you doing so much maintenance because of it?

So what is central storage and why are you doing so much maintenance because of it?

March 15, 2012

Sun 7410 with 92 drives

There's a total of 102 disks crammed in there. Don't scream at them.

You're probably sick of getting emails from me telling you that my team and I have to shut everything down in order to work on our "central storage equipment" or some other technobabble. Centralized storage amounts to one or more highly specialized servers that have loads of hard drives and can share all of that space with other servers.  In Oxy's case, our centralized storage comes in the form of a Sun (now Oracle) 7410, which you can see to the left.  The 7410 is crammed in the middle, sandwiched between pairs of big boxes with lots of disks.  Those boxes are called J4400s and we used to only have two of them. On January 7th, we added the other two.  Or at least, we tried. We ran out of space back in October of 2011, which is never a good thing.  We needed more disks but it turned out that was easier said than done.  Still, towards the end of 2011, we secured two used J4400s and it seemed like it was a pretty straightforward process of hooking them up and enjoying the additional free space. There was a catch, though.  Because we were installing second-hand parts, we needed Oracle techs to come out and "recertify" everything so that we could be eligible for support and maintenance.  In addition to making sure everything was installed properly, they wanted us to install the latest software updates. So on January 7th, we began the upgrade process.  It has not gone smoothly. For the most part, we got hampered by the need to replace or add parts.  First, we needed to buy 4 extra disks.  The used J4400s we got came with 22 disks each and they needed 24.  Then, we needed to replace the controller cards on the 7410, which Oracle shipped out to us. Remember how I mentioned earlier that we needed to upgrade the software?  Well, we couldn't because 4 out of the now 102 disks attached to our 7410 were not functioning properly. At first, Oracle thought it was a problem with the cables.  We ordered new ones and swapped them out.  No dice.  Then Oracle figured we could shut everything down, unplug everything, then plug everything back in.  We did this.  No luck.  Then we did it again. Remember that software update we couldn't install?  Oracle told us we could force it to ignore the problem with the 4 disks and install anyway, but that it was a bad idea.  Until, apparently, it wasn't and we end up forcing the install anyway.  So now we have the latest software which made us eligible to receive all of that technical support that we had been availing ourselves of anyway. Finally, it came down to something called a SIM board.  What's a SIM board?  I don't know but there are 8 of them, 2 in each J4400, and 1 of them is faulty.  Oracle can narrow it down to 2 possible culprits.  They ship us a new one and wish us luck. Finally, on March 14th (happy pi day, by the way), we power down our 7410, replace one of the SIMs, then turn everything back on. And it's finally fixed! Oh wait, I replaced the wrong one.  Shut everything down again, replace the other SIM, then power everything back on. And it's finally fixed! I'm glossing over the details here but each time we shut down the 7410, it involves shutting down nearly 50 servers that rely on it for storage, along with the entire Oxy Virtual Computer.  We've had lots of practice at this, obviously, but this still takes about 30 minutes.  The 7410 itself takes about 15 minutes to start up, and then it takes another 30 minutes or so to start up those 50 servers and the Oxy Virtual Computer again. So there you have it.  That all happened and it required a lot of downtime and a lot of people were inconvenienced by all of the maintenance we had to schedule to try to fix this.  I'm sorry about this.  I always say that your patience and understanding is appreciated - yes, I say that all the time so I'm sure it's wearing thin but seriously, your patience and understanding is really truly sincerely appreciated.

Information Resources

  • Academic Commons Vision
  • Blog
    • Yesterday's Internet Outage
    • Using OxyConnect with Internet Explorer 8
    • So Is Our Anti-Virus Totally Broken?
    • Information Resources 2012 Fall Newsletter now available!
    • Deptprinters queues
    • Academic Commons Taskforce: Progress and Process
    • Academic Commons….the history
    • Building and Rebuilding
    • By the Book – The Evolution of the Library
    • Do you love or hate conversation view for Gmail?
    • Does private browsing work?
    • One Perspective on The Digital Scholarship Institute
    • Optimizing Resources
    • OxyScholar Feb '12 Stats
    • Mapping New Directions in Academic Technologies
    • Making (Art) History With the Help of the DSI
    • Enumerating Badness: The latest way Facebook conspires to destroy you
    • Google Calendar sharing changes
    • How do you pick when to do maintenance?
    • Is there really a safety in numbers on Facebook?
    • A “Site” For More Eyes
    • A new method for accessing your files from off campus
    • Why does my antivirus icon look different?
    • Worried about losing your phone?
    • Why will the Internet go down for everyone if you're just doing work in HSC?
    • yeah, that was an earthquake
    • Where We Go To Learn
    • What's the point of all this maintenance?
    • A brief history of computing at Occidental
    • A Darker Shade of Green: Saving Power in the Datacenter
    • 2x upgrade
    • 2011/12 Library Break Hours extended thanks to student feedback
    • "Takeaway" Lunch - Scholars Discuss Their Experiences at the 2010 DSI
    • Pharos Print Drivers
    • The Definition of Simplicity
    • The DSI, Then and Next
    • The Attention Economy: Calculating the “Cost” of Information Overload
    • What does an Internet problem look like?
    • What exactly is a NOS anyway?
    • Welcome
    • Using Oxyconnect's Appointments Feature
    • The Right Tech For Real Results
    • Update on the 2011/10/07 network outage
    • Print your reports as PDFs from Banner
    • So what is central storage and why are you doing so much maintenance because of it?
    • SSL is supposed to solve two problems but most people only care about one of them
    • The Academic Commons Emerges
    • Scheduled Maintenance for Saturday, April 14th
    • Release to Print update - Coming Soon to Res Halls
    • Scheduled Maintenance for 15 Oct 2011
    • Scheduled Maintenance for 17 Sep 2011
  • Hours
  • People
  • Employment
  • Info Center:

    (323) 259-2640


  • Technology Helpdesk:

    (323) 259-2880 helpdesk@oxy.edu


  • IR Operations Offices: (323) 259-2832


  • Information Resources VP/CIO: (323) 259-1451
Tweet

Occidental College

  • For Parents
  • Employment
  • Contact Us
  • Maps & Directions

1600 Campus Road Los Angeles, California 90041