Re: Going from mod-3 to mod9

Bill Fairchild Mon, 30 Jan 2012 08:56:29 -0800

Thanks, Ron.  I didn't know about the 3990-3's ability to process two cache 
hits simultaneously.  I would assume, though, that both I/Os would have to be 
read only.  And all that the controller has to do to test for read-only is to 
test two bits in the File Mask byte.


The pending time I described existed with the introduction of S/370-X/A in the 
early 1980s for all attached DASDs, I believe.

The single threading with MDA is necessary if there is any possibility that one 
I/O can alter data that another active I/O to the same logical device might 
need to read.  This is controlled in EMC's 2105 microcode by single-threading 
all I/Os to device X that have at least one track in common in their defined 
extent track ranges AND if any of the I/Os with the extent overlap is not 
read-only.

As long as all the I/Os to device X are read-only, then an infinite number of 
them can run simultaneously with no worries.  But as soon as an I/O is found to 
be non-read-only, then its track range is used for single-threading against all 
other I/Os running that have at least one track in both I/O's track ranges.

E.g., if I/O (1) from system (A) to device X has a track range of 
00000000-5678000E and is read-only but I/O (2) from system (B) to device X has 
a track range of 5678000E-56790000 and also has the potential to write data 
somewhere within that range of only two tracks, then both of these I/Os must be 
single-threaded and executed in the same sequence in which they entered the 
control unit.  At least this is how EMC's patent application describes how 
their equivalent MDA microcode works.  And this single-threading is not forced 
upon the I/Os by sibling pend but rather is required in order to guarantee data 
integrity regardless of how many siblings or internal paths there are.

Bill Fairchild

-----Original Message-----
From: IBM Mainframe Discussion List [mailto:IBM-MAIN@bama.ua.edu] On Behalf Of 
Ron Hawkins
Sent: Friday, January 27, 2012 9:17 PM
To: IBM-MAIN@bama.ua.edu
Subject: Re: Going from mod-3 to mod9

Bill,

You said " If the shared control unit attached to this device is not an IBM
2105 SHARK (vintage ca. 2000), plug-compatible equivalent, or some successor 
technology, then image A's I/O will not really be started until image B's 
already started I/O ends."

Logical Device allegiance was introduced in the 3990-3 would allow ImageA and 
ImageB to process two cache hits concurrently providing they were not on the 
same track. My understanding is that the pending time you described occurs when 
the both systems tried to access the disk drives in 3990 controller emulation.

Multiple Device Allegiance (MDA) was developed to allow concurrent access to 
multiple disk drives that make up the logical volume. There's no need single 
threading IO requests from multiple systems when there's four or more disk 
drives that can handle the IO without sibling pend.

HDS had a function called the multi-job IO feature that would allow up to four 
LPARs to access the HDD for a logical volume concurrently. This was introduced 
in the 7700E, but they had to rewrite it in order to support MDA and PAV (9980V 
I think). MDA works without a PAV license.

Ron

> -----Original Message-----
> From: IBM Mainframe Discussion List [mailto:IBM-MAIN@bama.ua.edu] On
Behalf Of
> Bill Fairchild
> Sent: Monday, January 23, 2012 11:21 AM
> To: IBM-MAIN@bama.ua.edu
> Subject: Re: [IBM-MAIN] Going from mod-3 to mod9
> 
> In <45e5f2f45d7878458ee5ca679697335502e25...@usdaexch01.kbm1.loc>, on
> 01/23/2012
>    at 09:08 AM, "Staller, Allan" <allan.stal...@kbmg.com> said:
> 
> >From the viewpoint of the Operating System, you now have 3 times as 
> >much data behind the actuator on Mod-9's as Mod-3's.
> >If the Operating system *thinks* the device is busy, the IO is queued 
> >off the UCB and never even tried until it comes to head of queue.
> 
> You MIGHT have up to three times as much data behind the actuator.  
> That depends on how fully loaded the three mod-3s are which are to be 
> merged
onto
> the same single mod-9; i.e.,  it depends on which three mod-3s you 
> choose
to
> merge together.
> 
> If all data sets on all volumes are equally and randomly accessed, 
> then
you
> will have three times as much requirement to access the new mod-9 as 
> any
of
> the three mod-3s had which were merged.  However, most data centers 
> have highly skewed access patterns.  80% of the actuators might have 
> only 20%
of
> the total I/O workload.  Which means your volumes are almost certainly 
> NOT equally and randomly accessed.  You have some volumes that are 
> almost
never
> accessed and some others that are accessed all the time.
> 
> When z/OS starts in I/O on DASD device xxxx, z/OS turns on a flag bit 
> in
the
> UCB for that device that indicates that this particular z/OS image has
started
> an I/O on that device.  But if the device is shared, then another z/OS
image
> may have already started an I/O on the same device, turned that same
device's
> UCB flag bit on in its copy of the UCB for the device (which might be
device
> yyyy on the other image), and not informed any of the other sharing 
> z/OS images that it is now doing I/O on that shared device.  So when 
> image A
tests
> its private copy of the flag bit and finds it off, that does not
necessarily
> mean that the device is unbusy.  Image A doesn't care, however.  It 
> starts
the
> I/O and turns the bit on.  If the shared control unit attached to this
device
> is not an IBM 2105 SHARK (vintage ca. 2000), plug-compatible 
> equivalent,
or
> some successor technology, then image A's I/O will not really be 
> started
until
> image B's already started I/O ends.  This will show up on ima!
>  ge A as a spike in device pending time, not in IOSQ time.  The 2105 
> and
newer
> technology have the ability to let multiple I/O requests from multiple
sharing
> systems run simultaneously against the same device as long as there is 
> no conflict between any of the simultaneous I/Os involving both reads 
> and
writes
> for the same range of tracks.
> 
> The only way to know what will probably happen is to do I/O 
> measurement on your current mod-3 workload.  If you don't see much 
> IOSQ time now, then
you
> will see "not much" multiplied by three after merging.  How much is 
> not
much
> and/or is negligible is up to you to decide.  You might also get an 
> idea
as to
> how to merge volumes together based on their individual IOSQ times; 
> e.g., merge the one with the highest IOSQ time now with the two mod-3s 
> that now
have
> the lowest average IOSQ times.  After merging them, measure again for 
> IOSQ time.  Only if you have "excessive" IOSQ time, where how much is 
> excessive
is
> up to you to decide, would you need to consider using PAV devices.
> 
> Currently z/OS's I/O Supervisor has no knowledge of the real RAID
architecture
> backing the virtual SLED, so many of the classic performance- and 
> space- related bottlenecks can theoretically still occur.
> 
> Bill Fairchild
> 
> ----------------------------------------------------------------------
> For IBM-MAIN subscribe / signoff / archive access instructions, send 
> email
to
> lists...@bama.ua.edu with the message: INFO IBM-MAIN

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions, send email to 
lists...@bama.ua.edu with the message: INFO IBM-MAIN

----------------------------------------------------------------------
For IBM-MAIN subscribe / signoff / archive access instructions,
send email to lists...@bama.ua.edu with the message: INFO IBM-MAIN

Re: Going from mod-3 to mod9

Reply via email to