OOS Chapter 8
Download
Report
Transcript OOS Chapter 8
TROUBLESHOOT LINUX SYSTEM
CHAPTER 8
8.1 Troubleshoot methodology
The maintenance cycle
Monitoring: Observing system areas for problems or
irregularities
Proactive maintenance: Minimizing chance of future
problems
e.g., perform regular system backups
Reactive maintenance: Correcting problems when they
arise
Documenting solutions
Developing better proactive maintenance methods
Documentation: System information stored in a log book for
future references
Troubleshooting procedures: Tasks performed when solving
system problems
Common troubleshooting procedures
Two troubleshooting golden rules:
Prioritize problems according to severity
Spend reasonable amount of time on each problem given its priority
Try to solve root of problem
Avoid missing underlying cause
Justify why a certain solution is successful
Two categories of problems:
Hardware-related
Software-related
8.2 Hardware-Related Problems
Hardware-Related Problems
Often involve improper hardware or software configuration
SCSI termination
Video card and monitor configuration
POST test alerts
Loose hardware connections
IRQ or I/O address conflicts
View output of dmesg (control or print the kernel ring buffer) command
Absence of device drivers prevent OS from using associated
devices
Kudzu program: Detect and install support for new hardware
If hardware device not detected, device driver must be
configured manually
HDDs most common device to fail
Good idea to use RAID
The kudzu welcome screen
Configuring new hardware using kudzu
• If HDD containing partitions mounted on
noncritical directories fails:
– Power down computer and replace failed
HDD
– Boot Linux system
– Use fdisk to create partitions on replaced
HDD
– Use mkfs to create filesystems
– Restore original data
– Ensure /etc/fstab has appropriate entries to
mount filesystems
• If HDD containing / filesystem fails:
– Power down computer and replace failed
HDD
– Reinstall Linux on new HDD
– Restore original configuration and data files
• You should update your package every
time you made changes to your system
(hardware/software).
• You can run PUP, yum, apt-get, or GUI
based synaptic package manager to do
the update.
8.3 Software-Related Problems:
Application-Related Problems
Missing program libraries/files, process restrictions, or
conflicting applications
Dependencies: Prerequisite shared libraries or packages required
for program execution
Programs usually check at installation
Package files may be removed accidentally
• rpm –V command: Identify missing files in
a package or package dependency
• ldd command: Display shared libraries
used by a program
• ldconfig command: Updates
/etc/ld.so.conf and /etc/ld.so.cache files
• /etc/ld.so.conf file: List of directories
containing shared libraries
• /etc/ld.so.cache file: Contains location of
shared library files
• compressor/decompressor (codec) file:
Contains rules to compress or
decompress multimedia information
ulimit command: Modify process limit parameters in current
shell
Can also modify max number of filehandles
/var/log directory: Contains most system log files
If applications stop functioning due to difficulty gaining
resources, restart using SIGHUP / SIGHKILL
8.4 Software-Related Problems:
Operating System-Related Problems
Most software-related problems related to OS
Boot loader, filesystem, serial device problems
LILO problems: Place “linear” in, remove “compact”
from /etc/lilo.conf file
GRUB problems: Typically result of missing files in /boot
directory
mkbootdisk command: Create a boot floppy diskette
If filesystem on partition mounted to noncritical directory
becomes corrupted:
Unmount filesystem
Run fsck command with –f (full) option
If fsck command cannot repair filesystem, use mkfs command
to re-create the filesystem
Restore filesystem’s original data
If / filesystem is corrupted:
Boot from first Red Hat Fedora installation CD
Type “linux rescue” at welcome screen
Enter shell for Linux system on CD
Create new / filesystem via mkfs command
Restore original data to re-created / filesystem
Reboot system
Lost root password
First, you have to reboot into recovery mode.
If you have a single-boot (Ubuntu is the only operating
system on your computer), you may have to press the
Escape key during bootup in order to see the boot menu.
If you have a dual-boot (Ubuntu is installed next to
Windows, another Linux operating system, or Mac OS
X; and you choose at boot time which operating system
to boot into), the boot menu should appear without the
need to press the Escape key.
From the boot menu, select recovery mode, which is
usually the second boot option.
After you select recovery mode and wait for all the boot-up
processes to finish, you'll be presented with a few options. In
this case, you want the Drop to root shell prompt option so
press the Down arrow to get to that option, and then press
Enter to select it.
The root account is the ultimate administrator and can do
anything to the Ubuntu installation (including erase it), so please
be careful with what commands you enter in the root terminal.
Once you're at the root shell prompt, if you have forgotten your
username as well, type
ls /home
You should then see a list of the users on your Ubuntu
installation.
To reset the password, type passwd username
username is the username you want to reset.
You'll then be prompted for a new password. When you type the
password you will get no visual response acknowledging your
typing.Your password is still being accepted.
Just type the password and hit Enter when you're done.You'll be
prompted to retype the password. Do so and hit Enter again.
Now the password should be reset. Type
to the recovery menu.
exit
to return
After you get back to the recovery menu, select resume
normal boot, and use Ubuntu as you normally would—only
this time, you actually know the password!