Intel Shanghai's performance optimization engineers, Jiebin Sun, Zhiguo Zhou, Wangyang Guo, and Tianyou Li, have been focusing on enhancing the scalability of ClickHouse on Intel's ultra-high core count processors. As modern processors, like the Granite Rapids and Sierra Forest, push core counts to unprecedented levels, databases like ClickHouse face challenges in fully utilizing these cores due to bottlenecks such as lock contention, cache coherence, and memory management. The engineers employed various profiling tools to analyze ClickHouse's performance, leading to significant optimizations. These include reducing lock contention by rethinking thread coordination, optimizing memory management with improved allocator configurations, increasing parallelism in hash table merging, leveraging SIMD instructions for more efficient string searches, and addressing false sharing by aligning data structures with cache lines. These optimizations have resulted in substantial speedups for specific ClickBench queries, demonstrating that ClickHouse can effectively scale on high core count systems, a crucial capability as processors continue to evolve towards even higher core counts.