Re: Dictionary with repeated values?

2018-12-19 Thread Wes McKinney
The way that dictionary encoding is implemented in C++ (with DictionaryType, DictionaryArray) is a construct particular to the library. At the protocol level, dictionary encoding is a property of field at some level of a schema tree [1]. The dictionary itself is a record batch with a single field

Dictionary with repeated values?

2018-12-19 Thread Ben Kietzman
Is it legal to create a DictionaryType whose dictionary has repeated values?