I use AWS Athena to query some data stored in S3, namely partitioned parquet files with pyarrow compression.
I have three columns with string values, one column called "key" with int values and one column called "result" which have both double and int values.
With those columns, I created Schema like:
create external table (
key int,
result double,
location string,
vehicle_name string.
filename string
)
Run Code Online (Sandbox Code Playgroud)
When I queried the table, I would get
HIVE_BAD_DATA: Field results type INT64 in …