Raghuveer,

I am more confused than before.

You say that the destination is on the second line.  That seems to imply
that your data has more than one line per data point.  Is this so?  That
seems to contradict your previous comments.



On Wed, Mar 25, 2015 at 10:20 PM, Raghuveer <alwaysra...@yahoo.com.invalid>
wrote:

> Thanks for the reply.
> IPs means:
> ip, timestamp, bytes_transferred192.168.5.34, 1345456765434,
> 456192.168.4.24, 1345456765444, 34192.168.5.34, 1345456765454, 2355...
> Yesi have a list of IP addresses and i have extracted data from binary
> files and loaded them to HDFS in text format.
>   Destination IP address is not implicit infact its in the second row and
> is a server.
> Kindly suggest how i can do the kmeans clustering wrt timestamp or is
> there a better way?
> Regards,Raghuveer
>
>
>
>      On Thursday, March 26, 2015 6:34 AM, Ted Dunning <
> ted.dunn...@gmail.com> wrote:
>
>
>  This is an old question that I just dredged up in my email.
>
> There is still a question about your format here.  When you say "IPs" do
> you mean that you have a list of IP addresses?
>
> Or is this a server web-log?  Does that mean that the destination IP is
> implicit.  If so, you might be able to see a weak signal due to time
> proximity of different IP addresses, but I can't see that you would see
> much else.  Time proximity might give you a hint about wide-spread attacks.
>
>
>
> On Wed, Feb 18, 2015 at 6:49 AM, Raghuveer <alwaysra...@yahoo.com> wrote:
>
> >
> > Hi,
> >
> > I was going through mahout ppts online and came accross your email ID. I
> > have few issues when i want to analyse my dataset.
> >
> > i am trying to find how i can make use of my dataset to present some
> > relations. I have a dataset of the sort
> >
> > IPs,timestamp,bytes_tranferred
> >
> > what are the different relationships i can derive from this set so that i
> > can present some meaningful values using mahout. Currently am planning to
> > use this set to represent which client (in IPs column) had more traffic
> for
> > a given time. So i will have to group IPs together i guess. Are there any
> > better ideas and how can i do it using JAVA code It would be really
> helpful
> > if you can show me a sample for this issue. Kindly suggest.
> >
> > Thanks,
> > Raghuveer
> >
> >  On Tuesday, February 17, 2015 12:24 AM, Ted Dunning <
> > ted.dunn...@gmail.com> wrote:
> >
> >
> >
> > Please take questions like this to the Mahout mailing list.
> >
> > I really prefer to answer these questions in public.
> >
> > On Mon, Feb 16, 2015 at 3:51 AM, Raghuveer <alwaysra...@yahoo.com>
> wrote:
> >
> >
> >
> >
> >
> > Hi,
> >
> > I was going through mahout ppts online and came accross your email ID. I
> > have few issues when i want to analyse my dataset.
> >
> > i am trying to find how i can make use of my dataset to present some
> > relations. I have a dataset of the sort
> >
> > IPs,timestamp,bytes_tranferred
> >
> > what are the different relationships i can derive from this set so that i
> > can present some meaningful values using mahout. Currently am planning to
> > use this set to represent which client (in IPs column) had more traffic
> for
> > a given time. So i will have to group IPs together i guess. Are there any
> > better ideas and how can i do it using JAVA code It would be really
> helpful
> > if you can show me a sample for this issue. Kindly suggest.
> >
> > Thanks,
> > Raghuveer
> >
> >
> >
> >
> >
> >
> >
>
>
>

Reply via email to