RT @jaceklaskowski
Trying to get the better grip over aggregation execution in #ApacheSpark #SparkSQL and wonder what to google for to learn how to describe the topic in a more academic style.
Used "introduction aggregation" with and without "spark" and found some resources.
Any other recs? 🙏
Spark question:
Say you have a String field, the string represents a json (assuming a valid json) - how would you select rows that have more than 1 json?
Example: most rows have [{json}] , but some rows have [{json1},{json2}] -how can we get these rows using Spark sql?
#spark #sparksql