[jira] Updated: (MAPREDUCE-2267) Parallelize reading of blocks within a stripe
[ https://issues.apache.org/jira/browse/MAPREDUCE-2267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ramkumar Vadali updated MAPREDUCE-2267: --- Status: Open (was: Patch Available) Will upload another patch > Parallelize reading of blocks within a stripe > - > > Key: MAPREDUCE-2267 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-2267 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: contrib/raid >Affects Versions: 0.22.0 >Reporter: Ramkumar Vadali >Assignee: Ramkumar Vadali > Attachments: MAPREDUCE-2267.1.patch, MAPREDUCE-2267.2.patch, > MAPREDUCE-2267.3.patch, MAPREDUCE-2267.4.patch, MAPREDUCE-2267.patch > > > RAID code has several instances where several blocks of data have to be read > to perform an operation. For example, computing a parity block requires > reading the blocks of the source file. Similarly, generating a fixed block > requires reading a parity block and the good blocks from the source file. > These read operations proceed sequentially currently. RAID code should use a > thread pool to increase the parallelism and thus reduce latency. -- This message is automatically generated by JIRA. - For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] Updated: (MAPREDUCE-2267) Parallelize reading of blocks within a stripe
[ https://issues.apache.org/jira/browse/MAPREDUCE-2267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Scott Chen updated MAPREDUCE-2267: -- Hadoop Flags: [Reviewed] Status: Patch Available (was: Open) > Parallelize reading of blocks within a stripe > - > > Key: MAPREDUCE-2267 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-2267 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: contrib/raid >Affects Versions: 0.22.0 >Reporter: Ramkumar Vadali >Assignee: Ramkumar Vadali > Attachments: MAPREDUCE-2267.1.patch, MAPREDUCE-2267.2.patch, > MAPREDUCE-2267.3.patch, MAPREDUCE-2267.4.patch, MAPREDUCE-2267.patch > > > RAID code has several instances where several blocks of data have to be read > to perform an operation. For example, computing a parity block requires > reading the blocks of the source file. Similarly, generating a fixed block > requires reading a parity block and the good blocks from the source file. > These read operations proceed sequentially currently. RAID code should use a > thread pool to increase the parallelism and thus reduce latency. -- This message is automatically generated by JIRA. - For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] Updated: (MAPREDUCE-2267) Parallelize reading of blocks within a stripe
[ https://issues.apache.org/jira/browse/MAPREDUCE-2267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ramkumar Vadali updated MAPREDUCE-2267: --- Attachment: MAPREDUCE-2267.4.patch Attached diff from top level > Parallelize reading of blocks within a stripe > - > > Key: MAPREDUCE-2267 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-2267 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: contrib/raid >Affects Versions: 0.22.0 >Reporter: Ramkumar Vadali >Assignee: Ramkumar Vadali > Attachments: MAPREDUCE-2267.1.patch, MAPREDUCE-2267.2.patch, > MAPREDUCE-2267.3.patch, MAPREDUCE-2267.4.patch, MAPREDUCE-2267.patch > > > RAID code has several instances where several blocks of data have to be read > to perform an operation. For example, computing a parity block requires > reading the blocks of the source file. Similarly, generating a fixed block > requires reading a parity block and the good blocks from the source file. > These read operations proceed sequentially currently. RAID code should use a > thread pool to increase the parallelism and thus reduce latency. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Updated: (MAPREDUCE-2267) Parallelize reading of blocks within a stripe
[ https://issues.apache.org/jira/browse/MAPREDUCE-2267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ramkumar Vadali updated MAPREDUCE-2267: --- Attachment: MAPREDUCE-2267.3.patch Fixed RaidShell to not invoke the recoverFile RPC but use DistributedRaidFileSytsem to read a corrupt file > Parallelize reading of blocks within a stripe > - > > Key: MAPREDUCE-2267 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-2267 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: contrib/raid >Affects Versions: 0.22.0 >Reporter: Ramkumar Vadali >Assignee: Ramkumar Vadali > Attachments: MAPREDUCE-2267.1.patch, MAPREDUCE-2267.2.patch, > MAPREDUCE-2267.3.patch, MAPREDUCE-2267.patch > > > RAID code has several instances where several blocks of data have to be read > to perform an operation. For example, computing a parity block requires > reading the blocks of the source file. Similarly, generating a fixed block > requires reading a parity block and the good blocks from the source file. > These read operations proceed sequentially currently. RAID code should use a > thread pool to increase the parallelism and thus reduce latency. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Updated: (MAPREDUCE-2267) Parallelize reading of blocks within a stripe
[ https://issues.apache.org/jira/browse/MAPREDUCE-2267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ramkumar Vadali updated MAPREDUCE-2267: --- Attachment: MAPREDUCE-2267.2.patch review at https://reviews.apache.org/r/326/ > Parallelize reading of blocks within a stripe > - > > Key: MAPREDUCE-2267 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-2267 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: contrib/raid >Affects Versions: 0.22.0 >Reporter: Ramkumar Vadali >Assignee: Ramkumar Vadali > Attachments: MAPREDUCE-2267.1.patch, MAPREDUCE-2267.2.patch, > MAPREDUCE-2267.patch > > > RAID code has several instances where several blocks of data have to be read > to perform an operation. For example, computing a parity block requires > reading the blocks of the source file. Similarly, generating a fixed block > requires reading a parity block and the good blocks from the source file. > These read operations proceed sequentially currently. RAID code should use a > thread pool to increase the parallelism and thus reduce latency. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Updated: (MAPREDUCE-2267) Parallelize reading of blocks within a stripe
[ https://issues.apache.org/jira/browse/MAPREDUCE-2267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ramkumar Vadali updated MAPREDUCE-2267: --- Attachment: MAPREDUCE-2267.1.patch Fixing failures found during tests. > Parallelize reading of blocks within a stripe > - > > Key: MAPREDUCE-2267 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-2267 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: contrib/raid >Affects Versions: 0.22.0 >Reporter: Ramkumar Vadali >Assignee: Ramkumar Vadali > Attachments: MAPREDUCE-2267.1.patch, MAPREDUCE-2267.patch > > > RAID code has several instances where several blocks of data have to be read > to perform an operation. For example, computing a parity block requires > reading the blocks of the source file. Similarly, generating a fixed block > requires reading a parity block and the good blocks from the source file. > These read operations proceed sequentially currently. RAID code should use a > thread pool to increase the parallelism and thus reduce latency. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.
[jira] Updated: (MAPREDUCE-2267) Parallelize reading of blocks within a stripe
[ https://issues.apache.org/jira/browse/MAPREDUCE-2267?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ramkumar Vadali updated MAPREDUCE-2267: --- Attachment: MAPREDUCE-2267.patch > Parallelize reading of blocks within a stripe > - > > Key: MAPREDUCE-2267 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-2267 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: contrib/raid >Affects Versions: 0.22.0 >Reporter: Ramkumar Vadali >Assignee: Ramkumar Vadali > Attachments: MAPREDUCE-2267.patch > > > RAID code has several instances where several blocks of data have to be read > to perform an operation. For example, computing a parity block requires > reading the blocks of the source file. Similarly, generating a fixed block > requires reading a parity block and the good blocks from the source file. > These read operations proceed sequentially currently. RAID code should use a > thread pool to increase the parallelism and thus reduce latency. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.