Thanks for the reply.
IPs means:
ip, timestamp, bytes_transferred192.168.5.34, 1345456765434, 456192.168.4.24, 
1345456765444, 34192.168.5.34, 1345456765454, 2355...
Yesi have a list of IP addresses and i have extracted data from binary files 
and loaded them to HDFS in text format.
  Destination IP address is not implicit infact its in the second row and is a 
server.
Kindly suggest how i can do the kmeans clustering wrt timestamp or is there a 
better way?
Regards,Raghuveer



     On Thursday, March 26, 2015 6:34 AM, Ted Dunning <ted.dunn...@gmail.com> 
wrote:
   

 This is an old question that I just dredged up in my email.

There is still a question about your format here.  When you say "IPs" do
you mean that you have a list of IP addresses?

Or is this a server web-log?  Does that mean that the destination IP is
implicit.  If so, you might be able to see a weak signal due to time
proximity of different IP addresses, but I can't see that you would see
much else.  Time proximity might give you a hint about wide-spread attacks.



On Wed, Feb 18, 2015 at 6:49 AM, Raghuveer <alwaysra...@yahoo.com> wrote:

>
> Hi,
>
> I was going through mahout ppts online and came accross your email ID. I
> have few issues when i want to analyse my dataset.
>
> i am trying to find how i can make use of my dataset to present some
> relations. I have a dataset of the sort
>
> IPs,timestamp,bytes_tranferred
>
> what are the different relationships i can derive from this set so that i
> can present some meaningful values using mahout. Currently am planning to
> use this set to represent which client (in IPs column) had more traffic for
> a given time. So i will have to group IPs together i guess. Are there any
> better ideas and how can i do it using JAVA code It would be really helpful
> if you can show me a sample for this issue. Kindly suggest.
>
> Thanks,
> Raghuveer
>
>  On Tuesday, February 17, 2015 12:24 AM, Ted Dunning <
> ted.dunn...@gmail.com> wrote:
>
>
>
> Please take questions like this to the Mahout mailing list.
>
> I really prefer to answer these questions in public.
>
> On Mon, Feb 16, 2015 at 3:51 AM, Raghuveer <alwaysra...@yahoo.com> wrote:
>
>
>
>
>
> Hi,
>
> I was going through mahout ppts online and came accross your email ID. I
> have few issues when i want to analyse my dataset.
>
> i am trying to find how i can make use of my dataset to present some
> relations. I have a dataset of the sort
>
> IPs,timestamp,bytes_tranferred
>
> what are the different relationships i can derive from this set so that i
> can present some meaningful values using mahout. Currently am planning to
> use this set to represent which client (in IPs column) had more traffic for
> a given time. So i will have to group IPs together i guess. Are there any
> better ideas and how can i do it using JAVA code It would be really helpful
> if you can show me a sample for this issue. Kindly suggest.
>
> Thanks,
> Raghuveer
>
>
>
>
>
>
>


  

Reply via email to