OSX Lion on 2Gb ram is painfull

Since my ram went fubar in the Mac Mini, all I can say is that running Lion on 2Gb ram is painfull.

Can’t wait for the other ram to arrive that I ordered, as going back to the original factory memory will kill me. It’s like the machine is constantly swapping, which it probably is based on the sort of work I need to do on it from time to time.

Simpana 9 – Check Readiness on Windows, Linux and Macintosh passes although backups fail

Came across an interesting thing today and thought I would publish it, as it could certainly catch people out.

You’ll find the Client Readiness Check will pass on Windows, Linux and Macintosh clients (those I have checked so far), although you backup fails against the hosts with the error similar to below at the 5% mark.

Error Code: [19:599] 
Description: Loss of control process ifind.exe. 
Possible causes: 
1. The control process has unexpectedly died. Check Dr Watson log or core file.
2. The communication to the control process machine wentx862k8-1 might have gone down due to network errors. 
3. If the machine wentx862k8-1 is a cluster, it may have failed over. 
4. The machine wentx862k8-1 may have rebooted.

And readiness check against the client will pass as per below;

It appears the readiness check doesn’t actually check all services/processes on the client required to perform the backup. i.e. In this situation, I killed the EvMgrC process, which is certainly required, it’s as important the cvd process for the client to function and be backed up. Without it, the backup will fail to work, however as you can see Readiness Check will pass for the client, as it looks like EvMgrC is not actual checked.

Of course it’s rare for this EvMgrC to be unavailable like this, however it certainly could happen and cause confusion when client readiness check is performed and clearly says everything is okay.

On Windows clients be sure to check that the essential services are running, and on Unix platforms run the following command to ensure the key items have an associated PID.

# cd /opt/simpana/Base
# ./simpana list
+---------------------------------+---------+----------------------------------+
| Service name                    |   PID   | Service command                  |
+---------------------------------+---------+----------------------------------+
| cvlaunchd                       | 4020    | /opt/simpana/Base/cvlaunchd      |
+---------------------------------+---------+----------------------------------+
| cvd                             | 4091    | /opt/simpana/Base/cvd            |
+---------------------------------+---------+----------------------------------+
| EvMgrC                          | 4085    | /opt/simpana/Base/EvMgrC         |
+---------------------------------+---------+----------------------------------+

Hope this helps anyone in the internet that might come across this. This type of failure can be seen with the following error message reported in the FileScan.log on the client too as outlined below, good indication of the issue probably being as described.

3044 be8 11/15 19:28:01 ### EvSocket::doConnect() - Could not connect to wentx862k8-1(wentx862k8-1):EvMgrC: Connect to 127.0.0.1:8402 failed: Connection refused

Although it appears that Windows clients appear to recover the EvMgrC process after a period of time, so it should correct itself, however not seen that same behaviour on Linux and/or Macintosh clients as yet. Will keep investigating if they too recovery it.

Mac Mini 4,1 Kernel Panics

Been having some very odd issues on the Mac Mini 4,1 system I have. It looks like the mouse started to play up, and oddly enough new batteries never changed the issue. Soon as I removed the ram, and put back factory memory. It all came good. It’s safe to say the 2 x 2gb modules I got have some issues. Think I will bin those, as they had no warranty.

It looks like Other World Computing (macsales.com) has some great prices. Might order from them some replacements.

CommVault Simpana Linux client can backup but not restore

Had an interesting issue with a Linux client that was running CommVault Simpana where it could backup fine but any attempts to restore would not work. Problem is the error indicate network/comms related. Although wasn’t the case.

Errors indicated kernel parameters, upon further investigation it was indeed kernel parameters. The errors in the logs are shown below for reference;

ClRestore.log
24346 407db90  11/09 12:20:28 105 [PIPELAYER  ]  Pipeline not Created Yet or is missing. retrying...
24346 407db90  11/09 12:20:46 105 [PIPELAYER  ] ERROR: Error: Received Message type=7 on sd=13
24346 407db90  11/09 12:20:46 105 CPipelayer::InitiatePipeline() - Error initiating pipeline!  plInitiatePipeline returned -1
24346 407db90  11/09 12:20:47 105 CCVAPipelayer::StartPipeline() - Failed to initiate pipeline
24346 407db90  11/09 12:20:47 105 CVArchive::StartPipeline() - Startup of DataPipe failed

And…

Cvd.log
12462 b7efd6d0 11/09 10:28:15 ### [CVD        ] IPCKEYS Path=//opt/hds/Base/Temp/1320798495_5482_55344016, curr=1, dest=2, topid=0
12462 b7efd6d0 11/09 10:28:15 ### [CVD        ] IPCKEYS key[0]=0x540e0055, key[1]=0x0a0e0055, key[2]=0x0b0e0055, key[3]=0x090e0055, ReaderKey=0x3b0e0055
12463 b7f4e6d0 11/09 10:28:15 ### [CVD        ] IPCKEYS Path=//opt/hds/Base/Temp/1320798495_5482_55344016, curr=2, dest=3, topid=0
12462 b7efd6d0 11/09 10:28:15 ### [CVD        ] ERROR: initIpc: shmget() err 22, flag 950, size 2007080
12463 b7f4e6d0 11/09 10:28:15 ### [CVD        ] IPCKEYS key[0]=0x540e0055, key[1]=0x0b0e0055, key[2]=0x0c0e0055, key[3]=0x090e0055, ReaderKey=0x3b0e0055
12462 b7efd6d0 11/09 10:28:15 ### [CVD        ] ERROR: plInitIpc: initIpc() fail err 22
12463 b7f4e6d0 11/09 10:28:15 ### [CVD        ] ERROR: initIpc: shmget() err 22, flag 950, size 2007080
12463 b7f4e6d0 11/09 10:28:15 ### [CVD        ] ERROR: plInitIpc: initIpc() fail err 22

I took another look at the kernel parameters set for some key kernel.* items;

kernel.sem = 500	64000	64	256
kernel.msgmnb = 65536
kernel.msgmni = 16
kernel.msgmax = 65536
kernel.shmmni = 8192
kernel.shmall = 0
kernel.shmmax = 0

Of course those last 2 caused me concern, got these modified and the the client could restore.

Turns out, the host in question although a 32bit Linux host with a PAE kernel had some how had the sysctl.conf that appeared to have values for these two kernel parameters with integers from the 64bit initscripts package. As such the values had been so large, that when applied by sysctl -p at boot, they wrap to 0. Hence the output as seen above.
From 64bit sysctl.conf file;

..
# Controls the maximum shared segment size, in bytes
kernel.shmmax = 68719476736

# Controls the maximum number of shared memory segments, in pages
kernel.shmall = 4294967296

From 32bit sysctl.conf file;

..
# Controls the maximum shared segment size, in bytes
kernel.shmmax = 4294967295

# Controls the maximum number of shared memory segments, in pages
kernel.shmall = 268435456

Anyways, problem solved. Talk about a bit of research and testing to find the culprit on that one. Don’t ask me how the host got the 64bit sysctl.conf, I can only assume they got pushed via a scripted change, as I can’t see a bug filed against initscripts for RHEL 5.7 for this. I can only assume it was introduced by script and/or manually.

Just another hot and humid day.

Looks like summer might be a hot one if the weather today is anything to go by. It’s quite overcast at the moment, and the heat is just hanging about.

Had a nice BBQ earlier today with family and I ate so much I don’t think I will need to eat much for dinner.

Ended up using the spa before too, it was a nice end to the day.

Novell Certified Linux Administrator (CLA)

And thanks to Novell who have a partnership with LPI and provide LPIC-1 as an education path to the Novell Certified Linux Administator (CLA) certification, now have this certification too as of passing LPIC-1.

So it’s been a good week. Will be preparing for LPIC-2 now. Plan was for the new year to tackle those exams.

Solaris Training

It looks like Solaris 11 training updates are just around the corner (I noticed Oracle mention beta exams starting in 2012), so I am guessing if anyone was considering Solaris training, it would be wise to wait. Is this the general view of others?

Be interested to hear from you.

Reason I am curious and ask, is that my employer wants to get me more involved with Solaris. And I noticed Solaris certifications changed recently to require a course component for Solaris 10, so I imagine this will carry on for Solaris 11.

 

LPIC-1 Certification complete

Today I passed my LPIC-1 Exam 102, so I now have completed my LPIC-1 certification. I passed LPIC-1 Exam 101 last month.

Next goal was to do LPIC-2 which is 2 more exams, that was planned for next year, however I might start doing the reading and prep, maybe I can slip in 1 more before years end. Although I won’t stress doing so, as I am waiting on BF3 to show up by post and start playing as a reward.

It’s a good to celebrate.