top of page
Image by Markus Spiske

OUR TECHNOLOGY

Windjammer accelerates spark queries by 3x while reducing infrastructure cost to one quarter. We offer seamless integration with AWS-EMR and GCP-DataProc and 100% SPARK compatibility. Install Windjammer without interrupting your workflow. We're eager to benchmark our engine against any of our competitor's, apples-to-apples. Our devoted team of engineers make it easy to get answers to all your technical questions immediately.

Untitled design (1).png

Windjammer Accelerator provides industry leadership in Cloudstore Analytics and ML

general leadership.png

 Why Spark Acceleration?

  • Massive scale of use of Spark

  • Spark’s JVM is very CPU intensive causing server sprawl, performance instability & management challenges

  • Spark does not fully exploit high bandwidth of today’s cloud storage systems, causing high query run times

  • Standard Spark fault tolerance requires persisting data at shuffle boundaries, motivating complex and expensive shuffle services

Screen Shot 2023-08-16 at 8.51.59 AM.png

Windjammer Spark Accelerator

  • More efficient use of expensive CPU resources:

    • Native execution

    • MPP (massively parallel processing)  

    • Dataflow clustered architecture eliminating JVM bottlenecks

  • Fully exploits S3/GCS cloud storage bandwidth

  • Aggressive parallel, asynchronous prefetch of analytics data sets

  • Eliminates need for complex shuffle service:

    • Checkpoint-based fault-tolerance uses reliable, high bandwidth S3 cloud storage: no need for special shuffle service while providing full query fault tolerance including spot instance and cluster interruptions

  • Transparent, 100% compatible

Windjammer cuts cluster size to a quarter while accelerating performance

emr6.8+WJ scaling.png

Windjammer cuts customer costs to a third for Spark cloud deployments

tco (2).png

Windjammer’s slashes CPU to a quarter or more

emr6.8cpu.png

Windjammer boosts spark cloud store utilization by 3x

emr6.8bandwidth.png

If you are interested in...

  • 3x-5x increase in query throughput/vCPU to eliminate server sprawl

  • Making full use of S3/GCS  cloud storage bandwidth to speed up your Spark jobs

  • Eliminating the cost and hassle of a shuffle service  

  • Doing all this with a simple bootstrap extension in EMR cluster creation

  • Then you will be interested in deploying EMR and Dataproc with Windjammer’s Spark Accelerator

  • We look forward to working with you!

Patents

Facebook_logo_(square).png
768px-LinkedIn_logo_initials.png
bottom of page