Hi folks,

Old Issue, but Marc is definitely right. I don't know where is the problem,
but my infrastructure is poor, getting some broken status yet.

Is heavy to the streaming (and others API) server put a hash (md5) on each
status message that the client can verify it's integrity?

Thanks.

Augusto.

On Fri, Nov 12, 2010 at 10:32 PM, @IDisposable <idisposa...@gmail.com>wrote:

> There's NO QUESTION in my mind that this is a memory/disk/network
> issue. Somewhere in the chain of events leading to your parser,
> someone is dropping bits and tons of ECC/Checksum logic/etc. is
> missing it.  I really really really doubt the feed is corrupt when
> leaving twitter-land... it's getting borked on the way to your parser.
>
> FWIW, I've never seen any of these issues... and I consume a TON of
> tweets daily.  The difference between T and V is one bit in ASCII (hex
> 0x54 versus 0x56), between u and w (0x75 vs. 0x77), between h and j
> (0x68 and 0x6A). We're seeing a consistent flip-to-on of the 0x02
> bit.  You've probably got bad RAM if it were up to my estimation... if
> you've got hardware control of this machine, try upping the voltage on
> the Northbridge and DRAM if you can a notch.
>
> Marc
>
> On Nov 12, 6:52 am, Augusto Santos <augu...@gemeos.org> wrote:
> > In this period of wrong json status, I received either wrong date format
> of
> > created_at
> >
> > 2747941206892544   Thu Nov 11 35:42:14 +0000 2010
> > 2565022072963072   Thw Nov 11 03:35:23 +0000 2010
> > 2562132222672896   Tju Nov 11 03:23:54 +0000 2010
> > 2550619441209344   Thu Nov 11 02:38:0; +0000 2010
> > 2545567930523648   Vhu Nov 11 02:18:05 +0000 2010
> >
> > and so on...
> >
> > Looks like was only one char problem per status, that mess up everything.
> >
> > Since Thu Nov 11 15:44:37 +0000 2010, I get no json parser or corrupt
> > created_at problem.
> >
> > Thanks.
> >
> >
> >
> >
> >
> >
> >
> >
> >
> > On Thu, Nov 11, 2010 at 11:51 PM, Augusto Santos <augu...@gemeos.org>
> wrote:
> > > Hi Taylor,
> >
> > > First, thanks for the answer.
> >
> > > I'm using Phirehose lib to PHP, the native json_decode($status,TRUE)
> from
> > > PHP and after json decode I'm using mysql_real_escape_string for the
> string
> > > fields. I see now that my log routine use mysql escape either before
> query.
> > > So this examples are escaped according this mysql procedure.
> >
> > > Here is the amount of tweets with this problem. That's when json_decode
> > > didn't work, so there's no id_str or new_id_str in my $status[] array,
> then
> > > it's throw an error and log it with the json status. I can send you all
> > > these status if you want it.
> >
> > > Date    Hour(GMT-2)  Count
> > > 2010-11-11 13 97
> > > 2010-11-11 1 367
> > > 2010-11-11 0 521
> > > 2010-11-10 23 598
> > > 2010-11-10 22 569
> > > 2010-11-10 21 577
> > > 2010-11-10 20 619
> > > 2010-11-10 19 606
> > > 2010-11-10 18 603
> > > 2010-11-10 17 607
> > > 2010-11-10 16 247
> > > 2010-11-10 11 9
> > > 2010-11-09 22 2
> >
> > > Thanks, Augusto.
> >
> > > On Thu, Nov 11, 2010 at 2:52 PM, Taylor Singletary <
> > > taylorsinglet...@twitter.com> wrote:
> >
> > >> Hi Augusto,
> >
> > >> I monitored the sample stream this morning for a few hours for
> instances
> > >> similar to the JSON examples you've provided below and was unable to
> see the
> > >> scenario duplicated. What JSON parser are you using? Is there any
> other
> > >> processing that may have occurred prior to generating your examples
> below?
> > >> Do you know how your library is handling "escaped" quote values like
> \" ?
> >
> > >> How many of these did you observe?
> >
> > >> Thanks,
> > >> Taylor
> >
> > >> On Thu, Nov 11, 2010 at 5:10 AM, Augusto Santos <augu...@gemeos.org
> >wrote:
> >
> > >>> I recevied a lot of broken json status from streaming api
> >
> > >>> Count of brojen json by day
> > >>> 2010-11-09     2
> > >>> 2010-11-10     4435
> > >>> 2010-11-11     888
> >
> > >>> Examples:
> > >>> {"in_reply_to_status_id_str":"2563309119209472","text":"@joi4kitten I
> > >>> have that same
> > >>>
> fear.","truncated":false,"in_reply_to_user_id_str":"16155805","entities":{"
> user_menvions":[{"screen_name":"joi4kitten","indices":[0,11],"name":"joi4ki
> tten","id":16155805,"id_str":"16155805"}],"urls":[],"hashtags":[]},"geo":nu
> ll,"in_reply_to_status_id":2563309119209472,"place":{"country_code":"US","c
> ountry":"The
> > >>> United States of
> > >>>
> America","bounding_box":{"type":"Polygon","coordinates":[[[-76.965351,38.97
> 1109],[-76.909147,38.971109],[-76.909147,39.022114],[-76.965353,39.022114]]
> ]},"place_type":"city","attributes":{},"full_name":"College
> > >>> Park, MD","name":"College Park",*
> > >>> "id":"e4c17912c815124d"."url":"http:\/\/api.twitter.com
> > >>>
> \/1\/geo\/id\/e4c17912c815124d.json"*},"favorited":false,"source":"\u003Ca
> > >>> href=\"http:\/\/mobile.twitter.com\" rel=\"nofollow\"\u003EMobile
> > >>>
> Web\u003C\/a\u003E","contributors":null,"in_reply_to_screen_name":"joi4kitt
> en","coorfinates":null,"retweet_count":null,"in_reply_to_user_id":16155805,
> "created_at":"Thu
> > >>> Nov 11 03:38:52 +0000
> > >>>
> 2010","new_id_str":"2565897919139841","new_id":2565897919139841,"user":{"fo
> llow_request_sent":null,"lang":"en","time_zone":"Eastern
> > >>> Time (US &
> > >>>
> Canada)","screen_name":"kellygo","following":null,"profile_sidebar_border_c
> olor":"0A84A5","profile_background_image_url":"http:\/\/
> > >>> a3.twimg.com\/profile_background_images\/8381831\/twitter_background_with_b
> ear_2.jpg","notifications":null,"description":"I
> > >>> am not Kelly
> > >>>
> Osbourne.","listed_count":12,"profile_background_tile":false,"profile_backg
> round_color":"052743","statuses_count":1559,"location":"College
> > >>> Park,
> > >>>
> Md.","show_all_inline_media":false,"profile_use_background_image":true,"fav
> ourites_count":29,"profile_text_color":"000000","contributors_enabled":fals
> e,"protected":false,"geo_enabled":true,"created_at":"Sun
> > >>> Jul 22 02:12:05 +0000
> 2007","profile_link_color":"0A84A5","name":"Kelly
> > >>> Osborn","friends_count":235,"url":"http:\/\/kellygo.blogspot.com
> > >>> ","profile_image_url":"http:\/\/a1.twimg.com
> > >>>
> \/profile_images\/271727429\/kelly_o__posterized_normal.jpg","id":7632582,"
> id_str":"7632582","verified":false,"utc_offset":-18000,"profile_sidebar_fil
> l_color":"CFE9E2","followers_count":332},"retweeted":false,"id":25658979191
> 39841,"id_str":"2565897919139841"}
> >
> > >>> Here there's a dot where should be a comma, between id and url
> >
> > >>> {"in_reply_to_status_id_str":"2546974377119744","text":"@ozdj Ah,
> > >>> suddenly #TweetBurglar makes complete
> > >>>
> sense","truncated":false,"in_reply_to_user_id_str":"15324154","entities":{"
> user_mentions":[{"screen_name":"ozdj","indices":[0,5],"name":"Derek
> > >>>
> Jenkins","id":15324154,"id_str":"15324154"}],"urls":[],"hashtags":[{"text":
> "TweetBurglar","indices":[19,32]}]},"geo":{"type":"Point","coordinates":[-3
> 3.88533896,151.20496326]},"in_reply_to_status_id":2546974377119744,"place":
> null,"favorited":false,"source":"\u003Ca
> > >>> href=\"http:\/\/twitter.com\/\" rel=\"nofollow\"\u003ETwitter for
> > >>>
> iPhone\u003C\/a\u003E","contributors":null,"in_reply_to_screen_name":"ozdj"
> ,"coordinates":{"type":"Point","coordinates":[151.20496326,-33.88533896]},"
> retweet_count":null,"in_reply_to_user_id":15324154,"created_at":"Thu
> > >>> Nov 11 03:38:51 +0000
> > >>>
> 2010","new_id_str":"2565895784243200","new_id":2565895784243200,"user":{"la
> ng":"en","geo_enabled":true,"screen_name":"coffeetrails","following":null,"
> friends_count":251,"profile_sidebar_border_color":"362919","verified":false
> ,"follow_request_sent":null,"notifications":null,"time_zone":"Sydney","desc
> ription":"Specialising
> > >>> in organic & RA espresso-based coffees, organic teas, fresh
> sandwiches and
> > >>> gourmet treats. Shop 5, 2 Lee St Sydney (Henry Deane Plaza, Railway
> > >>>
> Sq)","profile_background_color":"362919","profile_background_image_url":"ht
> tp:\/\/
> > >>> a3.twimg.com\/profile_background_images\/66956323\/CRW_3458.jpg","location"
> :"Central
> > >>> Station,
> > >>>
> Sydney","favourites_count":204,"profile_text_color":"362919","listed_count"
> :29,"profile_background_tile":true,"protected":false,"statuses_count":1298,
> "crected_at":"Wed
> > >>> Oct 14 05:27:32 +0000
> 2009","profile_link_color":"7a2a2a","name":"Jonno,
> > >>> Coffee Trails",*"show_all_inline_media":fanse*
> > >>> ,"profile_use_background_image":true,"url":"http:\/\/
> www.facebook.com
> > >>> \/CoffeeTrails\/","profile_image_url":"http:\/\/a2.twimg.com
> > >>>
> \/profile_images\/624183782\/CRW_2981_normal.jpg","id":82284263,"id_str":"8
> 2284263","contributors_enabled":false,"utc_offset":36000,"profile_sidebar_f
> ill_color":"c2a887","followers_count":324},"retweeted":false,"id":256589578
> 4243200,"id_str":"2565895784243200"}
> >
> > >>> Here there's a 'fanse' in 'show_all_inline_media' where should be a
> > >>> 'false'
> >
> > >>> Any help?
> >
> > >>> --
> > >>> 氣
> >
> > >>> --
> > >>> Twitter developer documentation and resources:
> > >>>http://dev.twitter.com/doc
> > >>> API updates via Twitter:http://twitter.com/twitterapi
> > >>> Issues/Enhancements Tracker:
> > >>>http://code.google.com/p/twitter-api/issues/list
> > >>> Change your membership to this group:
> > >>>http://groups.google.com/group/twitter-development-talk
> >
> > >>  --
> > >> Twitter developer documentation and resources:
> http://dev.twitter.com/doc
> > >> API updates via Twitter:http://twitter.com/twitterapi
> > >> Issues/Enhancements Tracker:
> > >>http://code.google.com/p/twitter-api/issues/list
> > >> Change your membership to this group:
> > >>http://groups.google.com/group/twitter-development-talk
> >
> > > --
> > > 氣
> >
> > --
> > 氣
>
> --
> Twitter developer documentation and resources: http://dev.twitter.com/doc
> API updates via Twitter: http://twitter.com/twitterapi
> Issues/Enhancements Tracker:
> http://code.google.com/p/twitter-api/issues/list
> Change your membership to this group:
> http://groups.google.com/group/twitter-development-talk
>



-- 
氣

-- 
Twitter developer documentation and resources: http://dev.twitter.com/doc
API updates via Twitter: http://twitter.com/twitterapi
Issues/Enhancements Tracker: http://code.google.com/p/twitter-api/issues/list
Change your membership to this group: 
http://groups.google.com/group/twitter-development-talk

Reply via email to