| Hard Drive Failure |
|
|
|
|
I felt this was an excellent article for those who wish to expand their knowledge about what it is within a harddrive that fails. It also provides some insight and techniques on how data recovery is attempted. Please feel free to read through in the interest of expanding your knowledge. I will stress that although this article implies the simplicity of what we do it is not an easy task. The time to learn via trial and error is not when you've just lost your business data, your family memories etc. Hopefully I am providing information that gives hope on the possiblility of a succesful data recovery from your failing drive and not simply arming you with just enough information to be dangerous to your data. Brian Kinash - CEO Kinash Digital Media Inc., Lead Technician of Right IT Services
The following article is by Scott A. Moulton @ Forensic Strategy Services, LLC. Copyright © 2007, All rights reserved.
Data recovery is necessary when source material fails and where no good backup exists, either Physical or Logical. There are two types of data recovery in the standard basic sense. One type of data recovery is when there is damage to the media and the pre-existing data need to be retrieved. This will usually require the media to be repaired. The second form of data recovery is when files were purposely or accidently deleted. When this type of data recovery is necessary there is usually no damage to the media and standard software can be used to recover the data. This is the process that most software performs. Very few software programs understand damaged media. Because most software relies on calls and functions from the operating system for its input, it has no control itself over error correction or any functions that the operating system performs on the drive. I believe there a four phases to any data recovery.
Four Phases of Recovery
In a previous speech here at Defcon 14, I gave the basic inner workings of a hard drive and several ways you can repair it. I am sure that you can get that previous speech on DVD, find it on the web, or on http://www.myharddrivedied.com/ and it will give you a large amount of info that I am not going to discuss here today. Additionally, there is a whitepaper on the CD that includes more data and notes about repairing a hard drive. Since my last speech one of the most common questions I get everyday is “What is that clicking noise? How do I fix it?” This is not a simple problem by any means. So my goal today is to give you more insight into the inner workings your hard drive and explain how this problem occurs and what you might be able to do to fix it. Slide 1208: In this speech we are looking at the platter assembly where the heads are located, through the area of the preamp and the IC Logic Board down to the PCB. This is the area that affects what is causing the clicking noise that you hear. I am now going to explain how each of these things works and walk you through the drive functions. Part of what causes this clicking problem is related to the power on routine functions.
The boot sequence of a drive is as follows:
Slide1389: The drive begins to spin the spindle or as you would see, the platters begin to revolve. When the platters begin to revolve the air flow around the platter creates a force that is called an air bearing. This air bearing will fling off debris on the platters such as any dust particles or metal fragments from the standard operation of the drive. This air bearing also causes the plastic locking arm mechanism to move out of the way as soon as there is enough air flow for the head to float. Without that airflow the arm is locked in place and will not move over the platter. This is a way to protect the platter from the head touching the platter and causing physical damage. The opposite is true during a power down. When power is cut to the drive, during the last revolutions of the motor, it generates enough power to move the head back to it park position. Because of this, as you can imagine, if you get enough power on and power off cycles in a row it is possible for the head to be stuck in the center of the platter and never to be parked correctly causing several types of damage. In certain 80 gig laptop 2.5” inch drives it is common for the head to be stuck to the center of the platter, never having parked and keeping the platters from spinning. In most cases there is very little damage if the drive is opened and manually turned slow enough not to damage the head, and the data can be recovered, obviously never using this drive again. Slide 1483: At this point, if the all has proceeded correctly the air bearing will allow the head to float over the platter allowing it to move freely without scratching the surface of the platter.
Slide 1545: At this point, if the head is reading the Servo Timing info from the platter and relaying it to the circuitry so the controller knows the geographic information for the placement of data. (See previous speech at Defcon 14 for discussion about Voice Coil and stepping motors to understand the servo info).
Slide 1679: At this point the head moves to the System Area (SA) of the platters and reads the content that it requires as well as any additional firmware and overlays. Most of the time, the system area is on the outer tracks – the extreme outer edge. This is chosen by the manufacturer but is most common on the outside on 3.5 and is sometimes written to the inside tracks on a 2.5” inch drive.System Area Information Common Names
1. System Area Slide 1781: What is in the System Area Info. Each category is called a Module and is a UBA block.
1. Smart Data
Slide 1816: The System Area is made of UBA Modules (Utility Block Addressing) which are sector blocks logically grouped together that contain a specific MODULE. Each UBA block might be different per a drive manufacturer. The UBA # might be Smart Data on one drive and a different type of data on another drive. The UBA area is inaccessible over the standard interface. Most of the commands to talk to the UBA modules are vendor specific and which is generally not made publically available. There are certain pieces of hardware that can be used to communicate with this area such as the PC3000.
For example: In the UBA 1 Area it could be a Bad Block List. As larger drives have been created there has been a need for larger bad block areas. So this might be expanded from two sectors in a previous drive to three sectors in a newer drive. But the firmware for the drive can still refer to each of them as UBA 1 and does not have to have any changes made to the code in the firmware regardless of the size change. Slide 2221: When the drive is manufactured it is known that there is going to be errors in every drive. Drives use ECC to correct most errors and if ECC can correct the error then the sector is never marked as bad. If it is marked as bad, the drive puts the data in a bad block list. Most people know that their hard drive has a bad block table. What most people do not know is that their drive has TWO bad block tables. 1. P-List (Primary Defects List – manufacture defect info that does not change)
The G-List is where the bad blocks that your drive has on a daily basis are stored. Since the P-List is done at manufacturing time that list is never suppose to change. There is a very important reason to know about both lists in a low level recovery which I will explain when we get to the repair section. There are certain utilities that can read, delete, merge and change this data.
Slide 2422: The cylinder structure is extremely important because there are people believe they can just take the platters out and move them to a new drive. This is true you can do this, but you have to move all platters simultaneously.The reason is because data is written in a cylinder. Most people have heard the term cylinder in reference to their hard drive, but they have no idea what that means. Writing in a cylinder means that data is written in parallel due to the fact the heads are always moving together in the same stack. To make it more efficient data is written on the top of a platter and the bottom of the platter and the next platters and so on, at the same time. Your data is NOT written on the top of one platter and when that gets full then written to the next platter. It is written across all the platters at the same time, making a cylinder of your data.
Most data recovery software will scan an entire hard drive and then display a list of files and directory trees you can recover from. However, if there is a lot of damage to the drive, the scanning may never finish or it might die/kill it in the process. If you have smart software and you can figure out where your partitions start and where the MFT or FAT tables might be, you stand a better chance of getting the data you are looking for. If a standard utility was used to create the partition then the partition structure will begin on a cylinder boundary. Again, your partition will begin on a Cylinder Boundary. Software like Byteback (www.byteback.org) RecoverSoft Media Tools Pro (www.recoversoft.com), and Runtimes Disk Explorer (www.runtime.org) are smart enough to know the data exists on the cylinder boundary and will quickly check without you have to scan the whole hard drive and possibly saving your drive from disaster during the scan. There certainly are times that scanning will be required but it is best if you can avoid it except in an imaging process.
Slide 2585: The MR (magnetoresistive) head of the hard drive you can think of as the head of the 90’s. If you remember how reliable the drives were before 2000 it is mostly because of this head and the density of the platters. This head was used on drives mostly before we crossed the 10-20 gig barriers. The MR head could determine if a bit passed under it. When data passed parallel to the head, the head could detect the “MR Effect” due to movement of electrons causing the magnetic field to rotate positive and negative values.
Slide 2840: The GMR (giant magnetoresistive) head is the current head used on most hard drives. This head uses high end physics I do not claim to understand. The only major difference is the way the head has been changed to read perpendicular. The GMR head has four layers, a sensing layer, a conducting layer, a pinned layer and an exchange layer. It was discovered that if you took two magnetic layers and aligned them opposite each other with a soft layer between them that the magnetic force would align themselves in parallel. When a bit of data passes under the heads the electrons bounce around in the layers causing the pinned layer to spin.
For more info, read http://www.hitachigst.com/hdd/technolo/gmr/gmr.htm Slide 2865: Hard drives have switched to Perpendicular Recording. I talked about the changes and previous versions last year and you can reference that speech for more info. The biggest change switching to perpendicular is that the data is written up and down instead of longitudinal. Because of this, changes had to be made to the platter so it would not interfere with reading and writing.
Slide 2885: The coatings have changed and the substrate on the bottom (the platter itself) was the biggest change. Almost every platter has converted to a glass ceramic platter. What this means to you in data recovery is that it is obvious when a scratch occurs. In most cases you will be able to see though the platter. Sometimes the rings that are created by the scratch are so smooth that they look like they are supposed to be there. I assure you that they are not. It should be silver from one edge to the other with no rings at all. So if you see a ring, in most cases the game is over or your recovery just got a lot harder.
Slide 3000: The data structure that is written to the sectors is important to understand if you are using any diagnostic software. Many of them use common nomenclature to discuss the types of errors.
You will see the error codes here in almost all data recovery and diagnostic software. This particular block of data (slide 3259) is one single sector. It contains a 512 byte block of data. This is how on sector looks to every hard drive regardless of your operating system. I could not possibly explain every error you will see, but I can give you the basics of the most common you will see doing diagnostics. · IDNF is the Address not found. If the sector that holds this information is corrupt there is no way for the hard drive to locate this sector and it will return the result IDNF.
Slide 3559: The preamp is a chip that amplifies the signal coming from the heads of the drive. Since the data that is read coming from the heads is similar to a wave form from a speaker, the preamp will amplify it and send it on to the electronics for decoding. There are two types of preamps, one is soldered on, and the second is glued on. It is often possible for a preamp to come loose due to heat expansion and not to have a good connection to the board. It is also possible for the preamp to fail. This is one of the causes of the click of death for the hard drive. It is often difficult to replace or fix this circuit and is more likely you can do a platter swap to a good drive, or replace the head stack assembly. The voice coil was mentioned in previous information at Defcon 14.
Slide 3791: The cause of the click is from four possible areas, all resulting in the SA not being able to be read.
1. System Area of the drive cannot be read because the platter is scratched.
2. The head itself has a problem and cannot read the SA area.
3. Preamp on Actuator to the Head has gone bad and is not passing the correct signal to the electronics
4. The firmware on the board is damaged and does not initialize. This is sometimes caused by static electricity walking across the carpet to install the drives, or there is a short on the board, and additionally I see where someone has allowed the board on the bottom of the drive to touch metal cause it to burn. All will result in the same problem and will sound like the Click of Death. Recovery Software will not help you correct any of these until after you have repaired the drive and it is running again. Correcting Problems Now we move on to some of the things you can do about it on your own. The click of death is a very difficult problem to solve and in some cases will not be able to be solved especially without some very high end and expensive equipment. But I will tell you what I have been able to fix without that equipment. Slide 4009: Swapping the PCB (printed circuit board) Live to get around a SA area that cannot be read.
I have done this process several times successfully. It is not perfect but it is a possible chance you will have to recover your data. The first step is to get a hard drive as close to identical as the bad drive you have that is a working drive. At the bottom of this paper you will find help about matching hard drives and serial numbers. If the System Area is badly damaged or corrupt and for some reason the drive will not read the System Area you can attempt to do a live swap. What this means is that you can hook up the good drive, then you use software or windows and tell the drive to go to sleep. This will cause the drive to spin down but will still be live and powered up and mounted. Once the drive goes to sleep and the drive stops spinning you can unscrew the board, carefully so as not to let the screws roll around on the board, and disconnect the board and connect it to the bad drive. I suggest that once you do this, you go after the files you need very quickly. It’s possibly you will be able to make an image of the drive. Keep in mind, that whatever bad blocks that the drive had assigned to the other drive will be bad here as well. You could try to use some software to clear bad blocks before attempting this, however I don’t suggest it in most cases. That is because it is one more possible item that might cause failure. I would prefer to use the drive that was working and lose a few blocks. After you get what you can then you can attempt to make changes and go back for more data. This is a concept that works about 25% of the time.
Slide 4199: Imaging in Reverse
In dealing with damaged hard drives, I have run into many problems with cache memory on the drive. The problems will often show up as timeouts or ECC failures as well. For example, I try to read from a drive with16 megs of ram for cache and receive errors but the drive is otherwise appears ok. If there is an error 16 megs away from the sector I am reading my drive will die. As of now there is no way to turn off this cache. However, if you can image your drive backwards there is no cache. Memory on a drive only caches data forward. There are only three ways I know of to image a drive backwards. The first is free, and it is to use dd_rescue. dd_rescue has a special setting for imaging a drive backwards. There is also a special script for dd_rhelp to control dd_rescue for the purpose of data recovery. You can use this on Linux and it works on drives regardless of the operating system on the drive you are recovering from. Typically you will start at the MaxLBA number and work backwards down to 0 LBA. It works quite well and will work on a surprising number of drives that cannot be read any other way. Your other two choices are Media Tools Pro from RecoverSoft (http://www.recoversoft.com/) for Windows, which is about $400, or a piece of hardware which is extremely efficient at doing this type of recovery called Deepspar Disk Imager (http://www.deepspar.com/products-ds-disk-imager.html), which will cost between $3000 and $4000 depending on configuration. But you should contact each of these vendors for pricing, or use the free option! Slide 4259: Head Replacement SectionThis is the only section from last year I kept and it is because this is directly related to fixing this click of death problem. NOTE: If there is only one platter it might be easier to move the platter than to move the assembly. You have to make that choice. List of items needed:
Process for Head Replacement: 1. You will need to disassemble the heads and other components from the drive to clear the room for the head and components.
2. Disassemble the new hard drive, and carefully use folded paper to move the heads apart and to keep them apart as much as possible.
3. Carefully lift the assembly out of the drive and move it to the bad drive and reassemble. It will take about two hours to assemble correctly if you take your time. Do everything you can to get the heads lined up again.
You must get them lined up and review it before you turn the drive back on or the heads may slide into place and hit the edge of the platter ripping them off and scratching the platter. It is good to practice with another drive you do not care about before doing this. Slide 4500: Doing a Platter Swap for a Single PlatterList of items needed:
Be very careful to keep the orientation in the same direction to so that the platter will be in the correct location when you put the platter back on the new drive. Slide TBD: Doing a Platter Swap for a Multi-Platter In order to do a Multi-Platter replacement you will need a special tool. If you have more than one platter and you take out the platters and any one of them turns at all, you will never get them aligned again or be able to read the data. This is because the data is written in a cylinder. Since the data is in a cylinder you must have the exact same alignment of the platters in order to move them to a new hard drive.
There is a special tool called a Platter Replacement Stand. You can get one at SalvationData.com http://serenity.nicservices.com/exchweb/bin/redir.asp?URL=http://www.salvationdata.com/productDetail.asp?pn=00013 for around $250 plus postage. It is a really heavy stand and weighs about 10 pounds. The platter replacement tool is what you really need and it looks a lot like a coffee can with a slit in the side. Once you have moved your heads out of the way, this can sits down around all the platters and you can push down on a piece of metal mounted in the slit to tighten it around the platters. It also has a lid inside that sits on the top ring of the platters that will hold the screws and keep them from rolling around all over the platters.
The pressure from the “coffee can” will hold all the platters together; however you still have to be really careful about taking it out and turning it. It should go straight from one hard drive to the other as quickly as possible with as little movement as possible.
This is the best possible way to keep the drive platters lined up.
You will still reassemble the drive just like you do in a head stack replacement or a single platter replacement. The only difference is using this device to move the platters.
The plate inside the tool holds the screws so that they do not scratch the platter.
Realign the heads.
Slide 4945: The End with a Crashing Hard Drive Shattering into Parts
Additional Research Information and Notes Matching Serial Numbers on Hard Drives This link is where I keep track of documentation on how each hard drive needs to be matched for a working donor drive. I get this any where I can, use it if you can, and if you happen to find something out please let me know so I can add it to the collection! NOTES: REFIRBUSHED DRIVES QUANTUM SEAGATE FUJITSU IBM and HITACHI DRIVES - Usually the same drive IBM MLC codes have to match HITACHI WESTERN DIGITAL DRIVES SAMSUNG MAXTOR DRIVES The boot sector in the FAT32 partition
The boot sector in the FAT partition The data contained in the boot sector after the OEM name string is referred to as the BIOS parameter block or BPB
Random Notes and Ideas For Data Recovery 1. Drive goes to sleep, replace the board live
by Scott A. Moulton @ Forensic Strategy Services, LLC. Copyright © 2007, All rights reserved.
|
Your are currently browsing this site with Internet Explorer 6 (IE6).
Your current web browser must be updated to version 7 of Internet Explorer (IE7) to take advantage of all of template's capabilities.
Why should I upgrade to Internet Explorer 7? Microsoft has redesigned Internet Explorer from the ground up, with better security, new capabilities, and a whole new interface. Many changes resulted from the feedback of millions of users who tested prerelease versions of the new browser. The most compelling reason to upgrade is the improved security. The Internet of today is not the Internet of five years ago. There are dangers that simply didn't exist back in 2001, when Internet Explorer 6 was released to the world. Internet Explorer 7 makes surfing the web fundamentally safer by offering greater protection against viruses, spyware, and other online risks.Get free downloads for Internet Explorer 7, including recommended updates as they become available. To download Internet Explorer 7 in the language of your choice, please visit the Internet Explorer 7 worldwide page.








































