Create large UTF8 variable-length string type.
This data type may not be supported by all Arrow implementations. Unless you need to represent data larger than 2GB, you should prefer string().
Create an instance of large UTF8 variable-length binary type:
>>> import pyarrow as pa >>> pa.large_string() DataType(large_string)
and use the type to create an array:
>>> pa.array(['foo', 'bar'] * 50, type=pa.large_string()) <pyarrow.lib.LargeStringArray object at ...> [ "foo", "bar", ... "foo", "bar" ]