Server manufacturers ramp-up edge AI efforts

There has been a spate of developments in the server space, as manufacturers focus on supporting inference workloads at the edge

Cliff Saran, Managing Editor

Published: 13 Nov 2024 10:45

Server manufacturers have long recognised the niche in public cloud computing that physical servers neatly fill. This has evolved over time to IT leaders and the industry recognising that some workloads will always be run on-premise; some may run both on the public cloud and on-premise; and some may be wholly cloud-based.

Artificial intelligence (AI) inference is the workload that’s now gaining traction among the server providers, as they look to address concerns over data loss, data sovereignty and potential latency issues when crunching AI data from edge devices and the internet of things (IoT).

Dell Technologies has now extended its Dell NativeEdge operations software platform to simplify how organisations deploy, scale and use AI at the edge.

The Dell platform offers what the company describes as “device onboarding at scale”, remote management and multi-cloud application orchestration. According to Dell, NativeEdge offers high-availability capabilities to maintain critical business processes and edge AI workloads, which are able to continue to run irrespective of network disruptions or device failures. The platform also offers virtual machine (VM) migration and automatic application, compute and storage failover, which, said Dell, provides organisations increased reliability and continuous operations.

One of its customers, Nature Fresh Farms, is using the platform to manage over 1,000 IoT-enabled facilities. “Dell NativeEdge helps us monitor real-time infrastructure elements, ensuring optimal conditions for our produce, and receive comprehensive insights into our produce packaging operations,” said Keith Bradley, Nature Fresh Farms’ vice-president of information technology.

Coinciding with the KubeCon North America 2024 conference, Nutanix announced its support for hybrid and multi-cloud AI based on the new Nutanix Enterprise AI (NAI) platform. This can be deployed on any Kubernetes platform, at the edge, in core datacentres and on public cloud services.

Nutanix said NAI delivers a consistent hybrid multi-cloud operating model for accelerated AI workloads, helping organisations securely deploy, run and scale inference endpoints for large language models (LLMs) to support the deployment of generative AI (GenAI) applications in minutes, not days or weeks.

Edge AI with the hyperscalers

The public cloud platforms all offer feature-rich environments for GenAI, machine learning and running inference workloads. They also have product offerings to cater for AI inference on IoT and edge computing devices.

Amazon Web Services offers SageMaker Edge Agent; Azure IoT hub is part of the mix Microsoft offers; and Google has Google Distributed Cloud. Such offerings generally focus on doing the heavy lifting, namely machine learning, using the resources available in their respective public clouds to build data models. These are then deployed to power inference workloads at the edge.

What appears to be happening with the traditional server companies is that in response to the cloud AI threat, they see a number of opportunities. IT departments will continue to buy and deploy on-premise workloads, and AI at the edge is one such area of interest. The second factor likely to influence IT buyers is the availability of blueprints and templates to help them achieve their enterprise AI goals.

According to analyst Gartner, while the public cloud providers have been very good at showing the art of the possible with AI and GenAI, they have not been particularly good at helping organisations achieve their AI objectives.

Speaking at the recent Gartner Symposium, Daryl Plummer, chief research analyst at Gartner, warned that tech providers are too focused on looking at the advancement of AI from their perspective, without taking customers on the journey to achieve the objectives of these advanced AI systems. “Microsoft, Google, Amazon, Oracle, Meta and OpenAI have made one major mistake – they’re showing us what we can do, [but] they’re not showing us what we should do,” he said.

The missing pieces concern domain expertise and IT products and services that can be tailored to a customer’s unique requirements. This certainly looks like the area of focus the likes of Dell, HPE and Lenovo will look to grow in partnership with IT consulting firms.

Server manufacturers ramp-up edge AI efforts

There has been a spate of developments in the server space, as manufacturers focus on supporting inference workloads at the edge

Read more about edge AI

Edge AI with the hyperscalers

Read more on Artificial intelligence, automation and robotics

Dell arms partners with resiliency and automation tools

Dell makes private cloud optionality a priority

Dell unveils disaggregated infrastructure strategy

MWC 2025: Enterprise networking in a jungle of mobility