Converting SPDE to parquet via sd2pq #624
-
Hi I'm looking to convert some very large legacy I saw I was not able to locate entries in the documentation and examples on how to connect with the In short:
Thank you in advance! |
Beta Was this translation helpful? Give feedback.
Replies: 10 comments 6 replies
-
Yes, in SAS you submit a libname statement to connect to a datasource, be it SAS data sets, SPDS/SPDE data, DBMS data ... Then, hopefully the sd2pq() method which was user contributed, will work for you! |
Beta Was this translation helpful? Give feedback.
-
Thank you! Will give it a shot here in a moment. Unrelated, the link appears dead for me. |
Beta Was this translation helpful? Give feedback.
-
Don't know why that would be. Just google |
Beta Was this translation helpful? Give feedback.
-
Have had to delay the implementation due to a few things so will update you as soon as I'm able. Thank you for your help with this. |
Beta Was this translation helpful? Give feedback.
-
Is there anyway to get a quick "peek" at the generated
I suspect I'm calling the wrong table and what I'm grabbing is empty but I can't seem to be certain:
|
Beta Was this translation helpful? Give feedback.
-
So this worked quite swimmingly. Thank you so much!
That said, could you recommend a way to I've tried a few suggested attempts and they usually result in bad calls on the If nothing immediately comes to mind, happy to go ahead and resolve and close the question. |
Beta Was this translation helpful? Give feedback.
-
Here's an example of just providing dsopt on the sd2df call so you can easily subset rows. You can set it on the SASdata object and change it between calls too. There's more than one way depending upon if you whant it associated to the SASdata object for all of the methods you use, or just provide it on a method for a one time use. FYI, the cars data set has 428 rows. obs= is a bit of a misnomer, it should have been called lastobs=.
|
Beta Was this translation helpful? Give feedback.
-
as a comparison, setting dsopts on the SASdata object (obs 1-10), then applies that to all methods:
|
Beta Was this translation helpful? Give feedback.
-
Thanks for clarifying that! |
Beta Was this translation helpful? Give feedback.
-
Hi @tomweber-sas . Re-opening this again if I may. I seem to be getting an error with "failed" chunks for some of the sas9's I'm trying to convert. I created the following workflow to do it:
And usually pass it in the following manner via bash commands:
It's worked for several of our datasets. However, for particularly large one, it keeps throwing some form of "chunk" failure that gives me the impression the chunk size is too big or too small in some way at a certain point.
with a I initially thought I was just running the same scripts on accident but it's been occurring far too frequently even with the extra checks. environment.yml is below if you require it:
|
Beta Was this translation helpful? Give feedback.
Yes, in SAS you submit a libname statement to connect to a datasource, be it SAS data sets, SPDS/SPDE data, DBMS data ...
Then you have a libref, which references that data, and then the table names are the names of the data sets/tables.
Here's the doc for libname SPDE
Then, hopefully the sd2pq() method which was user contributed, will work for you!