I read online that in PyArrow a string column would have a column-level size limit of 2GB. However, in my work I noticed this doesn't hold. def some_function( self, raw_table: pa.Table, ): schema = ...
Then observe the data explorer: What did you expect to happen? In this example, all the columns should be boolean. However, only columns 1 and 3 are (the ones with None values) are marked boolean in ...