Topics with Label: SQL

Forum Posts

Sorted by:

by RaghuMundru • New Contributor III

10-15-2015 7:11:03 AM

26919 Views
15 replies
0 kudos

Resolved! I am running simple count and I am getting an error

Here is the error that I am getting when I run the following query statement=sqlContext.sql("SELECT count(*) FROM ARDATA_2015_09_01").show() ---------------------------------------------------------------------------Py4JJavaError Traceback (most rec...

Data Engineering

26919 Views
15 replies
0 kudos

10-15-2015 7:11:03 AM

View Replies

Latest Reply

muchave
New Contributor II

02-16-2020 8:29:38 PM

0 kudos

192.168.o.1 is a private IP address used to login the admin panel of a router. 192.168.l.l is the host address to change default router settings.

0 kudos

02-16-2020 8:29:38 PM

14 More Replies

by pepevo • New Contributor III

02-10-2020 7:23:36 AM

10095 Views
10 replies
0 kudos

Resolved! How to convert column type from decimal to date in sparksql

I need to convert column type from decimal to date in sparksql when the format is not yyyy-mm-dd? A table contains column data declared as decimal (38,0) and data is in yyyymmdd format and I am unable to run sql queries on it in databrick notebook. ...

Data Engineering

10095 Views
10 replies
0 kudos

02-10-2020 7:23:36 AM

View Replies

Latest Reply

pepevo
New Contributor III

02-13-2020 11:35:35 AM

0 kudos

thank you Tom. I made it work already.

0 kudos

02-13-2020 11:35:35 AM

9 More Replies

by User16301467532 • New Contributor II

07-15-2015 11:45:24 AM

16871 Views
9 replies
1 kudos

How can I change the parquet compression algorithm from gzip to something else?

Spark, by default, uses gzip to store parquet files. I would like to change the compression algorithm from gzip to snappy or lz4.

Data Engineering

16871 Views
9 replies
1 kudos

07-15-2015 11:45:24 AM

View Replies

Latest Reply

ZhenZeng
New Contributor II

10-01-2019 2:10:05 AM

1 kudos

spark.sql("set spark.sql.parquet.compression.codec=gzip");

1 kudos

10-01-2019 2:10:05 AM

8 More Replies

by MikeK_ • New Contributor II

11-29-2019 11:32:28 AM

13081 Views
1 replies
0 kudos

Resolved! SQL variables in a notebook

Hi, In an SQL notebook, using this link: https://docs.databricks.com/spark/latest/spark-sql/language-manual/set.html I managed to figure out to set values and how to get the value. SET my_val=10; //saves the value 10 for key my_val SET my_val; //dis...

Data Engineering

13081 Views
1 replies
0 kudos

11-29-2019 11:32:28 AM

View Replies

Latest Reply

shyam_9
Valued Contributor

12-01-2019 11:38:37 PM

0 kudos

Hi @Mike K.., you can do this with widgets and getArgument. Here's a small example of what that might look like: https://community.databricks.com/s/feed/0D53f00001HKHZfCAP

0 kudos

12-01-2019 11:38:37 PM

by tripplehay777 • New Contributor

09-01-2016 12:41:37 AM

10459 Views
1 replies
0 kudos

How can I create a Table from a CSV file with first column with data in dictionary format (JSON like)?

I have a csv file with the first column containing data in dictionary form (keys: value). [see below] I tried to create a table by uploading the csv file directly to databricks but the file can't be read. Is there a way for me to flatten or conver...

Data Engineering

10459 Views
1 replies
0 kudos

09-01-2016 12:41:37 AM

View Replies

Latest Reply

MaxStruever
New Contributor II

08-15-2019 12:37:19 PM

0 kudos

This is apparently a known issue, databricks has their own csv format handler which can handle this https://github.com/databricks/spark-csv SQL API CSV data source for Spark can infer data types: CREATE TABLE cars USING com.databricks.spark.csv OP...

0 kudos

08-15-2019 12:37:19 PM

by martinch • New Contributor II

03-01-2019 7:40:53 AM

8411 Views
4 replies
0 kudos

DROP TABLE IF EXISTS does not work

When I try to run the command spark.sql("DROP TABLE IF EXISTS table_to_drop") and the table does not exist, I get the following error: AnalysisException: "Table or view 'table_to_drop' not found in database 'null';;\nDropTableCommand `table_to_drop...

Data Engineering

8411 Views
4 replies
0 kudos

03-01-2019 7:40:53 AM

View Replies

Latest Reply

StevenWilliams
New Contributor II

07-30-2019 5:46:30 AM

0 kudos

I agree about this being a usability bug. Documentation clearly states that if the optional flag "IF EXISTS" is provided that the statement will do nothing.https://docs.databricks.com/spark/latest/spark-sql/language-manual/drop-table.htmlDrop Table ...

0 kudos

07-30-2019 5:46:30 AM

3 More Replies

by rishigc • New Contributor

04-25-2019 9:43:45 AM

12132 Views
1 replies
0 kudos

Split a row into multiple rows based on a column value in Spark SQL

Hi, I am trying to split a record in a table to 2 records based on a column value. Please refer to the sample below. The input table displays the 3 types of Product and their price. Notice that for a specific Product (row) only its corresponding col...

Data Engineering

12132 Views
1 replies
0 kudos

04-25-2019 9:43:45 AM

View Replies

Latest Reply

mathan_pillai
Valued Contributor

04-26-2019 3:31:30 AM

0 kudos

Hi @rishigc You can use something like below. SELECT explode(arrays_zip(split(Product, '+'), split(Price, '+') ) as product_and_price from df or df.withColumn("product_and_price", explode(arrays_zip(split(Product, '+'), split(Price, '+'))).select( ...

0 kudos

04-26-2019 3:31:30 AM

by dan11 • New Contributor II

03-04-2016 8:46:20 PM

2384 Views
4 replies
1 kudos

sql delete?

<pre> Hello databricks people, I started working with databricks today. I have a sql script which I developed with sqlite3 on a laptop. I want to port the script to databricks. I started with two sql statements: select count(prop_id) from prop0; del...

Data Engineering

2384 Views
4 replies
1 kudos

03-04-2016 8:46:20 PM

View Replies

Latest Reply

Bill_Chambers
Contributor II

03-11-2016 9:57:05 AM

1 kudos

Hey Dan, good to hear you're getting started with Databricks. This is not a limitation of Databricks it's a restriction built into Spark itself. Spark is not a data store, it's a distributed computation framework. Therefore deleting data would be un...

1 kudos

03-11-2016 9:57:05 AM

3 More Replies

by Tamara • New Contributor III

11-03-2015 4:01:50 AM

8772 Views
8 replies
1 kudos

Resolved! Can I connect to a MS SQL server table in Databricks account?

I'd like to access a table on a MS SQL Server (Microsoft). Is it possible from Databricks? To my understanding, the syntax is something like this (in a SQL Notebook): CREATE TEMPORARY TABLE jdbcTable USING org.apache.spark.sql.jdbc OPTIONS ( url...

Data Engineering

8772 Views
8 replies
1 kudos

11-03-2015 4:01:50 AM

View Replies

Latest Reply

JohnSmith091
New Contributor II

11-27-2018 1:19:31 AM

1 kudos

Thanks for the trick that you have shared with us. I am really amazed to use this informational post. If you are facing MacBook error like MacBook Pro won't turn on black screen then click the link.

1 kudos

11-27-2018 1:19:31 AM

7 More Replies

by semihcandoken • New Contributor

08-18-2016 9:29:07 PM

13642 Views
4 replies
0 kudos

How to convert column type from str to date in sparksql when the format is not yyyy-mm-dd?

I imported a large csv file into databricks as a table. I am able to run sql queries on it in a databricks notebook. In my table, I have a column that contains date information in the mm/dd/yyyy format : 12/29/2015 12/30/2015 etc... Databricks impo...

Data Engineering

13642 Views
4 replies
0 kudos

08-18-2016 9:29:07 PM

View Replies

Latest Reply

ShubhamGupta187
New Contributor II

04-19-2018 9:37:52 PM

0 kudos

@josephpconley would it be safe to cast a column that contains null values?

0 kudos

04-19-2018 9:37:52 PM

3 More Replies

by max522over • New Contributor II

06-09-2016 1:22:08 PM

12497 Views
3 replies
0 kudos

Resolved! I've set the partition mode to nonstrict in hive but spark is not seeing it

I've got a table I want to add some data to and it's partitoned. I want to use dynamic partitioning but I get this error org.apache.spark.SparkException: Dynamic partition strict mode requires at least one static partition column. To turn this off ...

Data Engineering

12497 Views
3 replies
0 kudos

06-09-2016 1:22:08 PM

View Replies

Latest Reply

max522over
New Contributor II

06-13-2016 3:53:56 PM

0 kudos

I got it working. This was exactly what I needed. Thank you @Peyman Mohajerian

0 kudos

06-13-2016 3:53:56 PM

2 More Replies

by dan11 • New Contributor II

03-07-2016 3:05:49 PM

3284 Views
1 replies
1 kudos

sql: how to convert datatype of column?

Bricklayers, I want to port this sql statement from sqlite to databricks: select cast(myage as number) as my_integer_age from ages; Does databricks allow me to do something like this?

Data Engineering

3284 Views
1 replies
1 kudos

03-07-2016 3:05:49 PM

View Replies

Latest Reply

raela
New Contributor III

03-08-2016 11:21:03 AM

1 kudos

@dan11 We don't support number in Spark SQL. Try using int, double, float, and your query should be fine. To run SQL in a notebook, just prepend any cell with %sql. %sql select cast(myage as double) as my_integer_age from ages;

1 kudos

03-08-2016 11:21:03 AM

by Anonymous • Not applicable

04-22-2015 9:24:42 AM

10108 Views
2 replies
0 kudos

How can I use display() in a python notebook with pyspark.sql.Row Objects, e.g. after calling the first() operation on a DataFrame?

I'm trying to display() the results from calling first() on a DataFrame, but display() doesn't work with pyspark.sql.Row objects. How can I display this result?

Data Engineering

10108 Views
2 replies
0 kudos

04-22-2015 9:24:42 AM

View Replies

Latest Reply

dnchari
New Contributor II

11-18-2015 3:22:36 PM

0 kudos

Use take()

0 kudos

11-18-2015 3:22:36 PM

1 More Replies