Corection typo: Should read 'Whiten intra class scatter' "Mark Harrison" <[EMAIL PROTECTED]> wrote in message news:FIif8.16518$[EMAIL PROTECTED].; > Good places to start: > > Optimal feature extractors, that's better than PCA because you whiten your > inter class scatter and so put all inter class comparisons on the same > level. The good thing is this will also reduce your feature vector > dimensionality to c-1 (where c is # classes). PCA will not do this. > > Check the stats of each class, is it Gaussian or known pdf? Apply > parameteric classifier if so. > > However you are lucky if you get good classification after this, so you will > probably need non linear, non parametric classifiers. Try K nearest > neighobour, but that might take the age of the Universe so use a condensing > algorithm first to get a smaller representative set. > > Matlab is what I use for coding, there are a lot of free toolboxes around. > Mostly I write my own though. > > Best wishes > > Andrew > > > "Rishabh Gupta" <[EMAIL PROTECTED]> wrote in message > news:a4eje9$ip8$[EMAIL PROTECTED].; > > Hi All, > > I'm a research student at the Department Of Electronics, University Of > > York, UK. I'm working a project related to music analysis and > > classification. I am at the stage where I perform some analysis on music > > files (currently only in MIDI format) and extract about 500 variables that > > are related to music properties like pitch, rhythm, polyphony and volume. > I > > am performing basic analysis like mean and standard deviation but then I > > also perform more elaborate analysis like measuring complexity of melody > and > > rhythm. > > > > The aim is that the variables obtained can be used to perform a number of > > different operations. > > - The variables can be used to classify / categorise each piece of > > music, on its own, in terms of some meta classifier (e.g. rock, pop, > > classical). > > - The variables can be used to perform comparison between two files. A > > variable from one music file can be compared to the equivalent variable in > > the other music file. By comparing all the variables in one file with the > > equivalent variable in the other file, an overall similarity measurement > can > > be obtained. > > > > The next stage is to test the ability of the of the variables obtained to > > perform the classification / comparison. I need to identify variables that > > are redundant (redundant in the sense of 'they do not provide any > > information' and 'they provide the same information as the other > variable') > > so that they can be removed and I need to identify variables that are > > distinguishing (provide the most amount of information). > > > > My Basic Questions Are: > > - What are the best statistical techniques / methods that should be > > applied here. E.g. I have looked at Principal Component Analysis; this > would > > be a good method to remove the redundant variables and hence reduce some > the > > amount of data that needs to be processed. Can anyone suggest any other > > sensible statistical anaysis methods? > > - What are the ideal tools / software to perform the clustering / > > classification. I have access to SPSS software but I have never used it > > before and am not really sure how to apply it or whether it is any good > when > > dealing with 100s of variables. > > > > So far I have been analysing each variable on its own 'by eye' by plotting > > the mean and sd for all music files. However this approach is not feasible > > in the long term since I am dealing with such a large number of variables. > > In addition, by looking at each variable on its own, I do not find > clusters > > / patterns that are only visible through multivariate analysis. If anyone > > can recommend a better approach I would be greatly appreciated. > > > > Any help or suggestion that can be offered will be greatly appreciated. > > > > Many Thanks! > > > > Rishabh Gupta > > > > > >
================================================================= Instructions for joining and leaving this list, remarks about the problem of INAPPROPRIATE MESSAGES, and archives are available at http://jse.stat.ncsu.edu/ =================================================================