Vast Data Platform aims at storage everywhere for AI/ML workloads

Vast Data to offer storage with data lake and warehouse functionality built in natively, in anticipation of a huge surge in AI/ML workloads and a need for ever-larger data stores

Antony Adshead, Computer Weekly

Published: 01 Aug 2023 17:00

Vast Data aims to get vaster with the launch of the Vast Data Platform that will provide customers with AI/ML-focused storage infrastructure that is intended to replace existing database, data lake and data warehouse functionality.

In terms of scale and outcomes, it aims at “global learning” and “constant realisation” from the AI systems it will support.

The Vast Data Platform marries its QLC flash-and-fast-cache storage subsystems with database-like capabilities at native storage input/output (I/O) level, plus AI compute functionality focused on continual learning and the ability to link numerous instances into a global grid. It comprises:

Vast Data Store: The company’s storage subsystem range, which has been shipping for some years now. This is built on high-density QLC flash that runs to enclosures of PB capacity, with storage-class memory (SCM) in what it calls “write-shaping”. Here, the SCM handles reads and writes to bulk storage in 1GB stripes to guarantee a 10-year lifespan for its QLC drives.

Vast DataBase: This brings database-like functionality to the fundamentals of how data is stored in Vast. As described by technical sales and marketing lead Jeff Denworth, this “next generation database” adds a tabular access method, with metadata describing how blocks of data on storage media are organised into files, objects and tables. According to Vast, this allows for rapid ingest of data as well as large volumes of query requests.

Vast Data Engine: A containerised, Python-based layer that brings AI processing on top of storage functionality. Here, the engine stores and handles functions and triggers that can, for example, bring the ability to rewrite queries based on the results of existing AI processing. In this, Vast wants to provide a platform that can provide AI learning that acts on the basis of what it had already learned.

Vast Data Space: The extensive geographical grid that brings a customer’s Data Platform instances together in a single namespace across on-premise and all the big three (AWS, Azure and GCP) public clouds. The idea is to create a mesh of computational resources (CPUs, GPUs and DPUs) that can move the data to compute or compute to data according to the gravity of either.

Vast Data Platform aims at storage everywhere for AI/ML workloads

Vast Data to offer storage with data lake and warehouse functionality built in natively, in anticipation of a huge surge in AI/ML workloads and a need for ever-larger data stores

Read more on unstructured data storage

Read more on Artificial intelligence, automation and robotics

Vast Data launches into AI stratosphere with AgentEngine

Vast Data sets sights on analytics, AI

Vast Data looks beyond storage to the data lakehouse

Pure Storage aims FlashBlade//E at unstructured data capacity