Home Data Center Cloudflare wants you to build AI applications on its edge network

by Jon Gold

Senior Writer

Cloudflare wants you to build AI applications on its edge network

News

Sep 27, 20233 mins

Artificial IntelligenceEdge Computing

AI running in serverless configurations on GPUs is coming to Cloudflare’s global network.

1 network internet connected grid earth power satellite view

Content delivery network (CDN), security and web services company Cloudflare is opening its worldwide network to companies looking to build and deploy AI models with new serverless AI, database and observability features, working with several new tech partners to do so.

Part one of Cloudflare’s new AI-focused initiative, announced today, is the Workers AI framework, which offers access to GPUs in Cloudflare’s network for a serverless way to run AI models. For users trying to run AI systems that are heavily latency dependent, the framework should offer the option of running workloads much closer to the network edge, reducing round-trip time. The company said that Workers AI is also designed to separate inference from training data, ensuring that consumer information is not misused.

The second of Cloudflare’s new AI announcements is Vectorize, which is a vector database designed to allow developers to create AI-based applications entirely on Cloudflare’s own systems. Vectorize works in tandem with Cloudlflare’s underlying network, again allowing work to be done closer to the end user, and has integrations with Workers AI which should let users generate embeddings in Workers AI and index them in Vectorize.

On its part, AI Gateway is a performance-management and optimization system designed to offer observability into AI applications running on Cloudflare’s network. AI Gateway provides data like number and duration of requests, costs of running the app, and user counts for AI applications, according to the company, as well as cost-saving options like rate limiting and caching answers to common queries.

Cloudflare also announced collaborations with Microsoft, Databricks and AI startup Hugging Face. Microsoft brings its ONNX runtime for continuity of AI models across cloud, edge or on-device usage, while the Databricks partnership adds that company’s MLflow open source platform for machine learning cycle management. Finally, Cloudflare’s network will be the first venue for customers to deploy Hugging Face’s powerful generative AI models in a serverless, GPU-powered environment.

Matthew Prince, co-founder and CEO of Cloudflare, said that the new offerings represent a major extension of the company’s developer platform, and that the goal is to make “inference infrastructure” accessible for all potential customers.

The network, according to Prince, is the best place to run AI.

“We’ve already seen interest from companies who are trying to solve this exact challenge of providing powerful experiences without sacrificing battery life or latency,” he said. “That said, as LLMs and AI become an integral part of every application, we believe Cloudflare is well suited for powering those by making it easy and affordable for developers to get started.”

All features announced are available immediately. Pricing will be, essentially, based on usage, with different schema for Workers, Vectorize and AI Gateway. (Vectorize, Prince noted, will be free to use until 2024.)

by Jon Gold

Senior Writer

Jon Gold covers IoT and wireless networking for Network World. He can be reached at jon_gold@ifoundrycodg.com.

Americas

Topics

About

Policies

Our Network

More

Cloudflare wants you to build AI applications on its edge network

AI running in serverless configurations on GPUs is coming to Cloudflare’s global network.

More from this author

Nvidia to buy AI orchestration software provider Run:ai

TSMC gets $6.6 billion in CHIPS funding for third Arizona fab

AI drives spending on cloud infrastructure, IDC reports

Two-in-one SIM offers network redundancy for IoT devices

Most popular authors

Show me more

Nscale offers AMD AI chips-as-a-service in green data center

Palo Alto extends SASE security, performance features

The logic of && and || on Linux

Has the hype around ‘Internet of Things’ paid off? | Ep. 145

Episode 1: Understanding Cisco’s Converged SDN Transport

Episode 2: Pluggable Optics and the Internet for the Future

Has the hype around ‘Internet of Things’ paid off?

Are unused IPv4 addresses a secret gold mine?

Preparing for a 6G wireless world: Exciting changes coming to the wireless industry

Cloudflare wants you to build AI applications on its edge network

AI running in serverless configurations on GPUs is coming to Cloudflare’s global network.

Related content

What is a virtual machine, and why are they so useful?

What is DNS and how does it work?

Appeal court overturns $1.6bn mainframe software ‘poaching’ ruling against IBM

Cisco, Red Hat extend networking, AI integrations

Newsletter Promo Module Test

More from this author

Nvidia to buy AI orchestration software provider Run:ai

TSMC gets $6.6 billion in CHIPS funding for third Arizona fab

AI drives spending on cloud infrastructure, IDC reports

Two-in-one SIM offers network redundancy for IoT devices

Most popular authors

Show me more

Nscale offers AMD AI chips-as-a-service in green data center

Palo Alto extends SASE security, performance features

The logic of && and || on Linux

Has the hype around ‘Internet of Things’ paid off? | Ep. 145

Episode 1: Understanding Cisco’s Converged SDN Transport

Episode 2: Pluggable Optics and the Internet for the Future

Has the hype around ‘Internet of Things’ paid off?

Are unused IPv4 addresses a secret gold mine?

Preparing for a 6G wireless world: Exciting changes coming to the wireless industry