Spark contains three major data structures and APIs for working with big data: RDDs, DataFrames and Datasets. Learn about the difference between them as well as when it's best to apply each.
MySQL tables fragment over time as delete and update queries are performed. With a large database, the fragmentation slows down queries significantly. Learn about when to use optimization and follow our tips on how to perform table defragmentation.
You've normalized a database, but now it's too complicated to analyze data. Database denormalization is a commonly used process in data analytics to simplify reports. Learn about the various methods to denormalize a database and when to apply it.