Re: [Scikit-learn-general] Machine learning on 2D problems.

2013-03-22 Thread Paolo Losi
Hi Albert On Fri, Mar 22, 2013 at 6:33 PM, Albert Kottke wrote: > Consider the following profiles: > > # A B C D > 1 10 9 3 2 > 2 4 5 4 5 > 3 6 5 6 7 > > I have removed thickness and just used layer number for simplicity. The > desired behavior is that profiles A and B are grouped

Re: [Scikit-learn-general] Machine learning on 2D problems.

2013-03-22 Thread Albert Kottke
Consider the following profiles: # A B C D 1 10 9 3 2 2 4 5 4 5 3 6 5 6 7 I have removed thickness and just used layer number for simplicity. The desired behavior is that profiles A and B are grouped together because they start high, decrease, and then slightly increase. Profiles C

Re: [Scikit-learn-general] Machine learning on 2D problems.

2013-03-22 Thread Lars Buitinck
2013/3/22 Albert Kottke : > My biggest question is forming the data into the X matrix (n_samples, > n_features). The approach you describe would cluster based on thickness and > velocity without consideration of the relationship between adjacent layers. > Initially, I want to try to cluster based o

Re: [Scikit-learn-general] Machine learning on 2D problems.

2013-03-22 Thread Albert Kottke
The NaNs at the base of the profile implies that the velocity in that layer continues on for some unspecified thickness, which I can handle using a couple of different approaches. I am not too concerned about that. My biggest question is forming the data into the X matrix (n_samples, n_features).

Re: [Scikit-learn-general] Machine learning on 2D problems.

2013-03-22 Thread Lars Buitinck
2013/3/22 Albert Kottke : > Here is the data that I would be working with: > > No Thickness DepthVp Vs > (m) (m)(m/s)(m/s) > 1,2.00,2.00, 480.00, 180.00 > 2,8.00, 10.00, 2320.00, 700.00 > 3,8.00, 18.00, 2980.00, 1150.00 > 4, 52.00, 70.00, 298

Re: [Scikit-learn-general] Machine learning on 2D problems.

2013-03-22 Thread Albert Kottke
Here is the data that I would be working with: No Thickness DepthVp Vs (m) (m)(m/s)(m/s) 1,2.00,2.00, 480.00, 180.00 2,8.00, 10.00, 2320.00, 700.00 3,8.00, 18.00, 2980.00, 1150.00 4, 52.00, 70.00, 2980.00, 1720.00 5, -, -, 3120.0

Re: [Scikit-learn-general] Machine learning on 2D problems.

2013-03-15 Thread Albert Kottke
Attached is a figure showing a collection of velocity profiles across a region. The goal would be to group each of theses curves into groups with similar characteristics My idea is to group the entire profile for two reasons: 1) The goal is to randomly generate similar profiles. 2) The layeri

Re: [Scikit-learn-general] Machine learning on 2D problems.

2013-03-14 Thread Ronnie Ghose
I think anything is ok ? On Thu, Mar 14, 2013 at 5:33 PM, Albert Kottke wrote: > Sure. It might take me a little time to put together. I could do a > collection of CSV files, or just do a gzipped json file of a Python > dict. > > Albert > > On Thu, Mar 14, 2013 at 2:29 PM, Ronnie Ghose > w

Re: [Scikit-learn-general] Machine learning on 2D problems.

2013-03-14 Thread Albert Kottke
Sure. It might take me a little time to put together. I could do a collection of CSV files, or just do a gzipped json file of a Python dict. Albert On Thu, Mar 14, 2013 at 2:29 PM, Ronnie Ghose wrote: > Could you release a part of your data? / Similar data? > > > On Thu, Mar 14, 2013 at 5:26 PM

Re: [Scikit-learn-general] Machine learning on 2D problems.

2013-03-14 Thread Ronnie Ghose
Could you release a part of your data? / Similar data? On Thu, Mar 14, 2013 at 5:26 PM, Albert Kottke wrote: > I sent this email earlier, but that attachment exceeded the attachment > limit so I am linking the attachment. > > http://i.imgur.com/dHida3t.png > > Attached is a figure showing a coll

Re: [Scikit-learn-general] Machine learning on 2D problems.

2013-03-14 Thread Albert Kottke
I sent this email earlier, but that attachment exceeded the attachment limit so I am linking the attachment. http://i.imgur.com/dHida3t.png Attached is a figure showing a collection of velocity profiles across a region. The goal would be to group each of theses curves into groups with similar ch

Re: [Scikit-learn-general] Machine learning on 2D problems.

2013-03-14 Thread Ronnie Ghose
What would be wrong with clustering via the thickness and velocity of each layer as features? On Mar 14, 2013 3:31 PM, "Albert Kottke" wrote: > I am a novice at machine learning, so pardon my ignorance. > > I have ~3000 velocity profiles, which consist of multiple layers defined > by a thickness

Re: [Scikit-learn-general] Machine learning on 2D problems.

2013-03-14 Thread Leon Palafox
Hi there How does your data set looks like, each time that you have a measurement, which kind of information do you have available? Leon On Thu, Mar 14, 2013 at 12:30 PM, Albert Kottke wrote: > I am a novice at machine learning, so pardon my ignorance. > > I have ~3000 velocity profiles, which

Re: [Scikit-learn-general] Machine learning on 2D problems.

2013-03-14 Thread Ronnie Ghose
Elaborate - sounds like two features as of now On Mar 14, 2013 3:31 PM, "Albert Kottke" wrote: > I am a novice at machine learning, so pardon my ignorance. > > I have ~3000 velocity profiles, which consist of multiple layers defined > by a thickness and velocity. My goal is cluster these profile

[Scikit-learn-general] Machine learning on 2D problems.

2013-03-14 Thread Albert Kottke
I am a novice at machine learning, so pardon my ignorance. I have ~3000 velocity profiles, which consist of multiple layers defined by a thickness and velocity. My goal is cluster these profiles into ~10 groups with similar characteristics. All of the examples that I have seen on the scikits-lea