by
KVNARK
• Honored Contributor II
- 2611 Views
- 1 replies
- 5 kudos
trying to access the SSAS POIWER BI dataset using MDX query from python LInux server. We are hitting roadblock. The existing setup works as expected in windows system due to adodb.dll but unable to connect in Linux. Any help would be much appreciated...
- 2611 Views
- 1 replies
- 5 kudos
Latest Reply
@KVNARK . :One potential solution would be to use an open-source MDX library for Python that can connect to SSAS, such as OLAP-XMLA for Python. This library can be used to execute MDX queries against a SSAS server, including Power BI datasets.Here's...
- 2180 Views
- 0 replies
- 0 kudos
I'm working on a task where I transform a dataset and re-save it to an S3 bucket. This involves joining the dataset to two others, dropping fields from the initial dataset which overlapped with fields from the other two, hashing certain fields with p...
- 2180 Views
- 0 replies
- 0 kudos
by
Niha1
• New Contributor III
- 1253 Views
- 0 replies
- 1 kudos
file_path = f"{datasets_dir}/airbnb/sf-listings/sf-listings-2019-03-06-clean.parquet/"2airbnb_df = spark.read.format("parquet").load(file_path)34display(airbnb_df)AnalysisException: Path does not exist: dbfs:/user/nniha9188@gmail.com/dbacademy/machi...
- 1253 Views
- 0 replies
- 1 kudos
- 1094 Views
- 0 replies
- 1 kudos
I converted a data job fro RDD to Dataset, and I've found that, in prod, the data job runs faster, which is nice.But unit test runs 3x slower than before.My best guess is that Dataset spends time doing a lot of stuffs like encoding, optimizing, query...
- 1094 Views
- 0 replies
- 1 kudos
- 4578 Views
- 4 replies
- 3 kudos
Can anyone tell me how I can access the customer_t1 dataset that is referenced in the book "Delta Lake - The Definitive Guide "? I am trying to follow along with one of the examples.
- 4578 Views
- 4 replies
- 3 kudos
Latest Reply
Some files are visualized here https://github.com/vinijaiswal/delta_time_travel/blob/main/Delta%20Time%20Travel.ipynb but it is quite strange that there is no source in repository. I think only one way is to write to Vini Jaiswal on github.
3 More Replies
by
Geeya
• New Contributor II
- 1819 Views
- 1 replies
- 0 kudos
The process for me to build model is:filter dataset and split into two datasetsfit model based on two datasets union two datasetsrepeat 1-3 stepsThe problem is that after several iterations, the model fitting time becomes longer dramatically, and the...
- 1819 Views
- 1 replies
- 0 kudos
Latest Reply
I assume that you are using PySpark to train a model? It sounds like you are collecting data on the driver and likely need to increase the size. Can you share any code?
- 8370 Views
- 5 replies
- 0 kudos
Getting the error when try to load the uploaded file in py notebook.# File location and type file_location = "//FileStore/tables/data/d1.csv" file_type = "csv" # CSV options infer_schema = "true" first_row_is_header = "false" delimiter = ","# The app...
- 8370 Views
- 5 replies
- 0 kudos
Latest Reply
@naughtonelad if your issue is solved,please let me know as I am facing the same problem
4 More Replies