Is the difference in threshold when there are a small number of bins explainable via the one bin difference? If it isn't then that is cause for concern. Otherwise it is just another side effect of the original difference. Small numbers of bins can easily break the assumptions in many of the histogram based methods, as can data with features like large numbers of zeros or maximum values.
On Thu, Jul 4, 2013 at 6:17 AM, Padfield, Dirk R (GE Global Research) < [email protected]> wrote: > Exactly. It seems crazy to use more than 256 bins in that case. But the > goal of that test was simply timing, and the best way to increase the > computation time is to increase the number of bins. > > The real takeaway message is: either implementation is equally fast for > reasonably-sized inputs, so there is no timing-based concern for replacing > one for the other. > > Dirk > > > ________________________________________ > From: Johnson, Hans J [[email protected]] > Sent: Wednesday, July 03, 2013 3:04 PM > To: Padfield, Dirk R (GE Global Research); Matt McCormick > Cc: Bradley Lowekamp; <[email protected]> Developers; > [email protected]; Gaëtan Lehmann > Subject: Re: [Insight-developers] OtsuThresholdCalculator versus > OtsuMultipleThresholdsCalculator > > Dirk, > > With 10,000,000 bins, and an unsigned char data set this seems suspicious. > Are there real needs for such a large number of bins? > > I'd hate to spend a lot of time doing optimizations for 10,000,000 bins if > there is no real-world use case for it. > > Hans > > > -----Original Message----- > From: <Padfield>, "Dirk R (GE Global Research)" > <[email protected]> > Date: Wednesday, July 3, 2013 1:27 PM > To: Matt McCormick <[email protected]>, Hans Johnson > <[email protected]> > Cc: Bradley Lowekamp <[email protected]>, ITK > <[email protected]>, "[email protected]" > <[email protected]>, Gaëtan Lehmann <[email protected]> > Subject: RE: [Insight-developers] OtsuThresholdCalculator versus > OtsuMultipleThresholdsCalculator > > Hi Folks, > > I agree as well that we should merge the two implementations and provide > an update on the migration guide. > > I looked in a little more detail at the two implementations. If I run > both algorithms for a large range of numberOfHistogramBins, they produce > very different results when the number is small, but very similar results > when the number of bins is large. This is disturbing in itself, so I am > looking into the implementations to see why this happens. > > In terms of the timing, they are both very fast for normal bin sizes. But > when I push the numberOfHistogramBins up to 1,000,000 and 10,000,000, I > get some differences: > > numberOfHistogramBins = 1,000,000: > Otsu = 5 sec > OtsuMultiple = 0.5 sec > > numberOfHistogramBins = 10,000,000: > Otsu = 45.80 sec > OtsuMultiple = 4.58 sec > > Thus, when the numberOfHistogramBins is very large, the difference is > significant: the OtsuMultiple is about 10 times faster. This is > encouraging since the OtsuMultiple is the more general implementation. > > All of these tests are on the cthead1.png image that was used for the > ctest. > > My plan is to carefully compare the implementations to see how to make > them report the same results. And then I will merge them so that the Otsu > is just a shell that inherits from OtsuMultiple. > > Thanks, > Dirk > > > ________________________________________ > From: Matt McCormick [[email protected]] > Sent: Tuesday, July 02, 2013 3:44 PM > To: Johnson, Hans J > Cc: Bradley Lowekamp; Padfield, Dirk R (GE Global Research); > <[email protected]> Developers; [email protected]; Gaëtan > Lehmann > Subject: Re: [Insight-developers] OtsuThresholdCalculator versus > OtsuMultipleThresholdsCalculator > > I concur with Brad and Hans. > > Adding Gaëtan in CC. > > Thanks, > Matt > > On Tue, Jul 2, 2013 at 5:08 PM, Johnson, Hans J <[email protected]> > wrote: > > I agree with brad. The two should produce the same results, even tough > >it > > may introduce a different numerical result in the > > OtsuMultipleThresholdCalculator. > > > > -----Original Message----- > > From: Bradley Lowekamp <[email protected]> > > Date: Tuesday, July 2, 2013 12:01 PM > > To: "Padfield, Dirk R (GE Global Research)" <[email protected]> > > Cc: ITK <[email protected]>, "[email protected]" > > <[email protected]> > > Subject: Re: [Insight-developers] OtsuThresholdCalculator > > versus OtsuMultipleThresholdsCalculator > > > > Dirk, > > > > I would vote to have the two produce the same results, and create a > >little > > migration guide which notes the change. > > > > Regarding, the refactoring, is there any difference in the algorithm > > complexity of the two? Any measurements of the performance difference > > between the two? > > > > Brad > > > > On Jul 2, 2013, at 11:04 AM, "Padfield, Dirk R (GE Global Research)" > > <[email protected]> wrote: > > > >> Hi Richard, > >> > >> Thank you for your response and insight. If anything would need to be > >>changed, I would also lean towards changing the OtsuMultiple because I > >>think we shouldn't change Otsu since I am sure many more people are using > >>the Otsu because it is a standard thresholding algorithm. I will do some > >>comparisons of the two against other implementations to see what I find. > >> > >> But the question still remains: is it okay to change the OtsuMultiple to > >>give the same output as Otsu for one threshold? I am thinking in terms > >>of those people who use OtsuMultiple whose results will then be slightly > >>different. Here are the advantages and disadvantages for keeping things > >>the same: > >> > >> Advantages: everyone's code still works as it did before > >> Disadvantages: the two implementations are inconsistent with each other > >>even though they are the same algorithm. The overlapping code cannot be > >>merged (inheritance). And future enhancements will need to be made in > >>both places. > >> > >> My vote is: change OtsuMultiple to be consistent with Otsu. > >> > >> What do others think? > >> > >> Dirk > >> > >> > >> ________________________________ > >> From: Richard Beare [[email protected]] > >> Sent: Monday, July 01, 2013 5:40 PM > >> To: Bradley Lowekamp > >> Cc: Padfield, Dirk R (GE Global Research); <[email protected]> > >>Developers > >> Subject: Re: [Insight-developers] OtsuThresholdCalculator versus > >>OtsuMultipleThresholdsCalculator > >> > >> Hi, > >> I think the order was - I introduced new filters copied from ImageJ, > >>then Gaetan started refactoring to use the histogram framework. We both > >>did some work to make that correspond to old versions. I don't remember > >>working on the MultipleThreshold version, but the code does look > >>similar, so perhaps it was done somewhere along the way - will need to > >>check the logs. > >> > >> I'm pretty sure that Otsu was producing the same results that it used to > >>- I didn't compare to other implementations. Thus, if the original Otsu > >>was correct then the current one should be too, which would suggest that > >>the MultipleThresholds version should probably change. > >> > >> Not sure when I'll get a chance to look at this in detail. > >> > >> I don't have a current email for Gaetan to CC for confirmation. > >> > >> > >> On Mon, Jul 1, 2013 at 11:02 PM, Bradley Lowekamp > >><[email protected]<mailto:[email protected]>> wrote: > >> Dirk, > >> > >> I believe Richard Beare did the refactoring of the thresholding > >>framework an Insight Journal Article. He will likely know why it is this > >>way better than anyone else. > >> > >> You also didn't say which implementation is correct. > >> > >> Brad > >> > >> > >> On Jun 30, 2013, at 10:03 PM, "Padfield, Dirk R (GE Global Research)" > >><[email protected]<mailto:[email protected]>> wrote: > >> > >>> Hi ITK Developers, > >>> > >>> I was just looking through the OtsuThresholdCalculator and > >>>OtsuMultipleThresholdsCalculator to see whether I could refactor them so > >>>that the Otsu inherits from the OtsuMultiple since the latter is a more > >>>general case of the former. Currently, the code for these two filters > >>>is totally different resulting in significant code duplication and a > >>>need to keep both filters in sync. > >>> > >>> As a first step, I wrote a CMake test to check that the output of the > >>>OtsuMultiple with 1 threshold is the same as the output of the Otsu. > >>>Unfortunately, they are not! The two filters output thresholds that are > >>>different by 1 histogram bin! This can be a quite extreme difference > >>>when the numberOfHistogramBins is low, and it leads to different > >>>thresholds even when the numberOfHistogramBins is reasonably high (say > >>>256). I tracked it down to this code in the Calculators: > >>> > >>> The relevant code from Otsu: > >>> const double tolerance = 0.00001; > >>> if ( (varBetween - tolerance) > maxVarBetween ) > >>> { > >>> maxVarBetween = varBetween; > >>> maxBinNumber = j; > >>> } > >>> } > >>> this->GetOutput()->Set( static_cast<OutputType>( > >>>histogram->GetMeasurement( maxBinNumber + 1, 0 ) ) ); > >>> > >>> The relevant code from MultipleOtsu: > >>> if ( varBetween > maxVarBetween ) > >>> { > >>> maxVarBetween = varBetween; > >>> maxVarThresholdIndexes = thresholdIndexes; > >>> } > >>> } > >>> for ( j = 0; j < m_NumberOfThresholds; j++ ) > >>> { > >>> m_Output[j] = histogram->GetBinMax(0, maxVarThresholdIndexes[j]); > >>> } > >>> > >>> The difference is that the Otsu adds one to the computed threshold > >>>whereas the MultipleOtsu does not. This is problematic because users > >>>would expect them to give the same result. > >>> > >>> My question is: how should we proceed? If we change one or the other, > >>>people's code that use the changed one will give slightly different > >>>answers. If we don't change them, the two filters will give different > >>>outputs for the same input, and it will not be possible to refactor them > >>>to share code. > >>> > >>> What are your thoughts? > >>> > >>> Thanks, > >>> Dirk > >>> _______________________________________________ > >>> Powered by www.kitware.com<http://www.kitware.com> > >>> > >>> Visit other Kitware open-source projects at > >>> http://www.kitware.com/opensource/opensource.html > >>> > >>> Kitware offers ITK Training Courses, for more information visit: > >>> http://kitware.com/products/protraining.php > >>> > >>> Please keep messages on-topic and check the ITK FAQ at: > >>> http://www.itk.org/Wiki/ITK_FAQ > >>> > >>> Follow this link to subscribe/unsubscribe: > >>> http://www.itk.org/mailman/listinfo/insight-developers > >> > >> > > > > _______________________________________________ > > Powered by www.kitware.com > > > > Visit other Kitware open-source projects at > > http://www.kitware.com/opensource/opensource.html > > > > Kitware offers ITK Training Courses, for more information visit: > > http://kitware.com/products/protraining.php > > > > Please keep messages on-topic and check the ITK FAQ at: > > http://www.itk.org/Wiki/ITK_FAQ > > > > Follow this link to subscribe/unsubscribe: > > http://www.itk.org/mailman/listinfo/insight-developers > > > > > > > > ________________________________ > > Notice: This UI Health Care e-mail (including attachments) is covered by > >the Electronic Communications Privacy Act, 18 U.S.C. 2510-2521, is > >confidential and may be legally privileged. If you are not the intended > >recipient, you are hereby notified that any retention, dissemination, > >distribution, or copying of this communication is strictly prohibited. > >Please reply to the sender that you have received the message in error, > >then delete it. Thank you. > > ________________________________ > > _______________________________________________ > > Powered by www.kitware.com > > > > Visit other Kitware open-source projects at > > http://www.kitware.com/opensource/opensource.html > > > > Kitware offers ITK Training Courses, for more information visit: > > http://kitware.com/products/protraining.php > > > > Please keep messages on-topic and check the ITK FAQ at: > > http://www.itk.org/Wiki/ITK_FAQ > > > > Follow this link to subscribe/unsubscribe: > > http://www.itk.org/mailman/listinfo/insight-developers > > > > ________________________________ > Notice: This UI Health Care e-mail (including attachments) is covered by > the Electronic Communications Privacy Act, 18 U.S.C. 2510-2521, is > confidential and may be legally privileged. If you are not the intended > recipient, you are hereby notified that any retention, dissemination, > distribution, or copying of this communication is strictly prohibited. > Please reply to the sender that you have received the message in error, > then delete it. Thank you. > ________________________________ >
_______________________________________________ Powered by www.kitware.com Visit other Kitware open-source projects at http://www.kitware.com/opensource/opensource.html Kitware offers ITK Training Courses, for more information visit: http://kitware.com/products/protraining.php Please keep messages on-topic and check the ITK FAQ at: http://www.itk.org/Wiki/ITK_FAQ Follow this link to subscribe/unsubscribe: http://www.itk.org/mailman/listinfo/insight-developers
