FileAlreadyExistsException error while analyzing table in Notebook
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-08-2024 03:13 PM
Databricks experts,
I'm new to Databricks, and encounter an issue with the ANALYZE TABLE command in the Notebook.
I created two tables nyc_taxi and nyc_taxi2, from one csv file.
When executing the following command in Notebook,
analyze table nyc_taxi2 compute statistics for columns passenger_count;
CREATE TABLE nyc_taxi2
(vendor_id String,
pickup_datetime timestamp,
dropoff_datetime timestamp,
passenger_count int,
trip_distance double,
pickup_longitude double,
pickup_latitude double,
rate_code int,
store_and_fwd_flag string,
dropoff_longitude double,
dropoff_latitude double,
payment_type string,
fare_amount double,
surcharge double,
mta_tax double,
tip_amount double,
tolls_amount double,
total_amount double)
USING CSV
OPTIONS("path"="/users/myfolder/nyc_taxi.csv","header" = "true");
Can anyone direct what could be the reason for this problem?
Thanks for the help!
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
02-09-2024 09:35 AM
Hi @Retired_mod , thank you for your reply!
I realized that another main difference between nyc_taxi and nyc_taxi2 is that nyc_taxi created using the UI, is a managed table, whereas nyc_taxi2 created using the SQL command is an external table. The locations are also different, nyc_taxi is stored under "dbfs:/user/hive/warehouse/myschema.db/nyc_taxi"; nyc_taxi2 is stored under "

