Re: [FRIAM] NSF CAREER award supports faster, smarter use of massive scientific data | UKNow

Steve Smith Thu, 05 Feb 2026 15:56:24 -0800

As usual, Brother Steve is seeing this at a much higher conceptual andprocess level than I am.From the perspective of Analytic Journalism, if we're dealing with alarge data set -- say 10K to 1 million records -- we would first drawa sample of a small TK percent to develop and test our assumptions,methods, and process. Once it's stable, run it against a largersample. If it is still stable, then throw it against the total dataset.

your original post triggered a cascade of memories (some of which Iblurted out here) as well as a jam-session with mybar-friend-cum-technical-interlocutor GPT who led me on a merry chasethrough some latent techniques I once ideated on (some blurted hereearlier).

A phrase that came out of that tete-a-tete fits what I think you aredescribing from your own POV (highly relevant in these modern times of3M record data-dumps from DOJ to try to baffle-with-BS) is "pre-image".


GPT offered me the more explicit denotation:

   /The pre-image is not “the” original data point, but:/

   /*the equivalence class of upstream possibilities consistent with
   the downstream observation.*/

I have a lot of respect for those of you who swim well in suchhigh-dimensional and poorly defined, poorly conditioned data sets suchas "the news stream".

.- .-.. .-.. / ..-. --- --- - . .-. ... / .- .-. . / .-- .-. --- -. --. / ... 
--- -- . / .- .-. . / ..- ... . ..-. ..- .-..
FRIAM Applied Complexity Group listserv
Fridays 9a-12p Friday St. Johns Cafe   /   Thursdays 9a-12p Zoom 
https://bit.ly/virtualfriam
to (un)subscribe http://redfish.com/mailman/listinfo/friam_redfish.com
FRIAM-COMIC http://friam-comic.blogspot.com/
archives:  5/2017 thru present https://redfish.com/pipermail/friam_redfish.com/
  1/2003 thru 6/2021  http://friam.383.s1.nabble.com/

Re: [FRIAM] NSF CAREER award supports faster, smarter use of massive scientific data | UKNow

Reply via email to