[ https://issues.apache.org/jira/browse/PDFBOX-1893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
John Hewson updated PDFBOX-1893: -------------------------------- Description: I'm currently working on this, so I wanted to open an issue to let everyone know. Color spaces need to be refactored in 2.0.0. Tilman noticed slowness in PDFBOX-1851 due to using ICC profiles and calling ColorSpace#toRGB for every pixel. For example, the file from PDFBOX-1851 went from rendering in 4 seconds to taking over 60 seconds. The solution is to use ColorConvertOp to convert an entire BufferedImage in one go, taking advantage of AWT's native color management module. Color conversions done this way are almost instantaneous, even for large images. The current design of color spaces within PDFBox depends upon conversions being done on a per-pixel basis, so a significant refactoring is needed in order to convert images using ColorConvertOp without having to resort to per-pixel calls in cases such as a Separation color space which uses a CMYK alternate color space via a tint-transform. The color space handling code is also tightly coupled to image handling. The various classes which read images each have their own color handling code which rely on per-pixel conversions. For this reason any color space refactoring must also included a significant refactoring of image handling code. This is an opportunity to refactor all color handling so that it is encapsulated within the color space classes, allowing downstream users to call toRGB(float[]) or toRGB(BufferedImage) and not need to worry about tint transforms and the like. was: I'm currently working on this, so I wanted to open an issue to let everyone know. Color spaces need to be refactored in 2.0.0. Tilman noticed slowness in PDFBOX-1851 due to using ICC profiles and calling {{ColorSpace#toRGB}} for every pixel. For example, the file from PDFBOX-1851 went from rendering in 4 seconds to taking over 60 seconds. The solution is to use {{ColorConvertOp}} to convert an entire {{BufferedImage}} in one go, taking advantage of AWT's native color management module. Color conversions done this way are almost instantaneous, even for large images. The current design of color spaces within PDFBox depends upon conversions being done on a per-pixel basis, so a significant refactoring is needed in order to convert images using {{ColorConvertOp}} without having to resort to per-pixel calls in cases such as a Separation color space which uses a CMYK alternate color space via a tint-transform. The color space handling code is also tightly coupled to image handling. The various classes which read images each have their own color handling code which rely on per-pixel conversions. For this reason any color space refactoring must also included a significant refactoring of image handling code. This is an opportunity to refactor all color handling so that it is encapsulated within the color space classes, allowing downstream users to call {{toRGB(float[])}} or {{toRGB(BufferedImage)}} and not need to worry about tint transforms and the like. > Refactor color spaces > --------------------- > > Key: PDFBOX-1893 > URL: https://issues.apache.org/jira/browse/PDFBOX-1893 > Project: PDFBox > Issue Type: Improvement > Components: PDModel, Utilities > Affects Versions: 2.0.0 > Reporter: John Hewson > Assignee: John Hewson > Labels: color > > I'm currently working on this, so I wanted to open an issue to let everyone > know. > Color spaces need to be refactored in 2.0.0. Tilman noticed slowness in > PDFBOX-1851 due to using ICC profiles and calling ColorSpace#toRGB for every > pixel. For example, the file from PDFBOX-1851 went from rendering in 4 > seconds to taking over 60 seconds. > The solution is to use ColorConvertOp to convert an entire BufferedImage in > one go, taking advantage of AWT's native color management module. Color > conversions done this way are almost instantaneous, even for large images. > The current design of color spaces within PDFBox depends upon conversions > being done on a per-pixel basis, so a significant refactoring is needed in > order to convert images using ColorConvertOp without having to resort to > per-pixel calls in cases such as a Separation color space which uses a CMYK > alternate color space via a tint-transform. > The color space handling code is also tightly coupled to image handling. The > various classes which read images each have their own color handling code > which rely on per-pixel conversions. For this reason any color space > refactoring must also included a significant refactoring of image handling > code. This is an opportunity to refactor all color handling so that it is > encapsulated within the color space classes, allowing downstream users to > call toRGB(float[]) or toRGB(BufferedImage) and not need to worry about tint > transforms and the like. -- This message was sent by Atlassian JIRA (v6.1.5#6160)