Hi,
       Your guess was right ,thanks a ton again..yes
when i ran installation with kernel heap checking
enabled ,ran into a panic reporting 
'redzone violation: write past end of buffer' ,and
this was for the 'scsi_pkt' structure during
scsi_hba_pkt_free() 

Oddly (or maybe not considering 32/64-bit issues) this
problem does not happen with the normal 64bit driver
,only with 32 bit since thats what the installation
kernel is also running 

In order to debug exactly where the error might be
happening i tried to insert the same few lines as in
kmem_free( which is called during scsi_hba_pkt_free) 
at different points in my code where scsi_pkt is being
accessed

if (((uint8_t *)buf)[size] != KMEM_REDZONE_BYTE) {
 /// LOG ERROR or panic here!
} 

where buf-> scsi_pkt structure in my driver
      size =scsi_pkt_wrapper_len 

For some reason i found error right after
scsi_hba_pkt_alloc() was invoked in my
tran_init_pkt(),what does this mean ..how could that
happen ?? Is there something wrong with my
pkt_ha_private structure that is leading to this?


BTW i have installed the same OS on a local IDE hard
disk on the same hardware et al , so im guessing that
rules any h/w issues?

Thanks
Som

 



--- Jürgen Keil <[EMAIL PROTECTED]> wrote:

> Somnath kotur wrote:
> 
> >         OOPS! i forgot the attachment :( ..am
> > attaching it again now ... I'm also copy pasting
> below
> > the first few lines of the install_log and also
> the
> > end of the file 
> > 
> >
>
/*****************************************************/
> > 
> > Installation partially failed
> > Error: 1332 blocks
> > Error:
> > Error: Processing package instance <MYIscsi> from
> > </tmp/19039>
> > Error:
> > Error: My Dummy iSCSI HBA Driver for
> Solaris(i86pc)
> > 1.0.0
> > Error: Using </a> as the package base directory.
> > Error: ## Processing package information.
> > Error: ## Processing system information.
> > Error:    3 package pathnames are already properly
> > installed.
> > Error: ## Verifying disk space requirements.
> > Error:
> > Error: Installing My Dummy iSCSI HBA Driver for
> > Solaris as <BEIscsi>
> > Error:
> > Error: ## Installing part 1 of 1.
> > Error: /a/kernel/drv/amd64/myiscsi
> > Error: /a/kernel/drv/myiscsi
> > Error: [ verifying class <none> ]
> > Error: ## Executing postinstall script.
> > Error:
> > Error: Installation of <MYIscsi> was successful.
> > Error:
> > Error: *** IMPORTANT NOTICE ***
> > Error:         This machine must now be rebooted
> in
> > order to ensure
> > Error:         sane operation.  Execute
> > Error:                shutdown -y -i6 -g0
> > Error:         and wait for the "Console Login:"
> > prompt.
> > 
> >
>
/*****************************************************/
> 
> In the middle of the install_log file, we also find
> this:
> 
> 
>      8742 blocks
>      Segmentation Fault - core dumped
>      pkgadd: ERROR: class action script did not
> complete successfully
> 
>      Installation of <SUNWolrte> partially failed.
> 
> 
> 
> Hmm, some unknown process seems to be segfaulting
> duing the
> install...    Does it always log a "segmentation
> fault" for
> the SUNWolrte package, when you install S10?  That
> is, is
> that segfault reproducable?
> 
> 
> IIRC, some days ago you posted an install log file
> containing
> strange "Operation not applicable" error messages.
> 
> 
> What kind of hardware are you using?  Could this be
> a problem
> due to a bad memory module?
> 
> Or is it possible that your new iSCSI kernel driver
> is causing
> this kind of instability of the OS?  Maybe you
> should try an
> S10 installation with kernel heap checking enabled;
> boot the
> installation kernel with options "-kd", and from
> kmdb set the
> kernel variable "kmem_flags/W 0xf" and start the
> kernel
> with ":c", ...
> 
> See also:
> 
>     
>
http://opensolaris.org/os/project/cn/files/System_hangs_debugging-ext-v1.1.pdf
> 
> 
> 
> > /************ END OF THE FILE
> ***********************/
> > 
> > Installing boot information
> >         - Updating boot environment configuration
> file
> >         - Installing boot blocks (c1t0d0)
> >         - Installing boot blocks
> (/dev/rdsk/c1t0d0s0)
> > svc.configd: Fatal error:
> /a/etc/svc/repository.db:
> > integrity check failed. Details in
> > /etc/svc/volatile/db_errors
> > svccfg: Repository server failed (exit 102).
> > 
> >
> /**************************************************/
> 
> 
> 
> 
> > This definitely points to some failure regarding
> my
> > package installation off the DU ?  
> > Also even if it is ultimately due a 'disk read'
> error
> > ..any pointers on how to exactly debug this ? 
> 



      
____________________________________________________________________________________
Be a better friend, newshound, and 
know-it-all with Yahoo! Mobile.  Try it now.  
http://mobile.yahoo.com/;_ylt=Ahu06i62sR8HDtDypao8Wcj9tAcJ 

_______________________________________________
driver-discuss mailing list
driver-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/driver-discuss

Reply via email to