frequency?

Elijah Stone Sun, 11 Sep 2022 22:10:34 -0700

It would be helpful to know what code you were using that was not working onsymbols and boxes. One quick solution is x ({.,#)/. i.#x; that gives atwo-column table whose first column gives indices and whose second columngives the count of elements at the corresponding index. Faster is to separate#/.x and I.@:~:x, building the index and length lists separately.

Assuming you do not care about the content of the words, you may find itconvenient to create a dictionary of words encountered, and then representeach word with its index in the dictionary.

+/|:= and tying that to the nub somehow

That should work. I would use +/"1=y rather than +/|:=y. The result is afrequency count for each element of the nub; no extra mapping required.


 -E

On Mon, 12 Sep 2022, 'Viktor Grigorov' via Programming wrote:

Hey,
Whilst getting back to a Markov text generator in J, I quickly came to theissue of all top-result verbs for histograms, found querying the wiki andthe mailing lists, to be dealing with numeric types only. One'd have toreshape the items appropriately to a new dimension. But if one has n-tuplesof words, what then? The verbs give domain errors on symbols and boxes.
Obviously, I. can't be used, and something long would just beill-performant, like +/|:= and tying that to the nub somehow. What wouldwork with symbols, boxed strings, or rank 2 array? What would work well? Atypical novel is 8e4 +- 2e4, that's a lot of tuples, and my idea is toemploy 3-, 4-, and 5-length groupings, just to compare results withdifferent settings. I've dealt little-to-not-at-all with symbols, and boxedstrings,
----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

----------------------------------------------------------------------
For information about J forums see http://www.jsoftware.com/forums.htm

Re: [Jprogramming] type-independent histogram/occurence count/frequency?

Reply via email to