โ01-27-2024 08:38 AM - edited โ01-27-2024 09:04 AM
I have a JavRDD with complex nested xml content that I want to unmarshall using JAXB and get the data in to java objects. Can anyone please help with how can I achieve?
Thanks
โ01-27-2024 10:08 AM
I hope this should work
JavaPairRDD<String, PortableDataStream> jrdd = javaSparkContext.binaryFiles("<path_to_file>");Map<String, PortableDataStream> mp = jrdd.collectAsMap();OutputStream os = new FileOutputStream(f);mp.values().forEach(pd -> { try { os.write(pd.toArray()); } catch (IOException e) { throw new RuntimeException(e); }});os.flush();And then supplying file to jaxb unmarshaller. Not sure if there is a better way.
never-displayed