Image for ORC (Optimized Row Columnar)

ORC (Optimized Row Columnar)

Optimized Row Columnar (ORC) is a file format used to store large sets of data efficiently in databases and big data systems. Instead of saving data in traditional rows, ORC organizes it into columns. This structure allows for better data compression and enables faster data reading, especially for analytical queries that often access only specific columns. As a result, ORC improves performance and reduces storage costs, making it easier for businesses to analyze large amounts of data effectively. It is commonly used in data processing frameworks like Apache Hive and Apache Spark.