AI infrastructure
Enterprises investing in deep learning platforms need AI infrastructure sufficient enough to synthesize a massive amount of data. Find the information you need to make decisions about AI-specific compute architectures -- from GPU-packed servers to highly scalable clustered computing systems built for big data and machine learning applications.
Top Stories
-
News
11 Apr 2024
Meta's new silicon shows a growing trend for AI hyperscalers
The new hardware highlights a growing trend of hyperscalers designing custom chips for internal use. This move will help vendors rely less on hardware providers such as Nvidia. Continue Reading
-
Tip
11 Apr 2024
Building networks for AI workloads
Conventional and high-performance computing networks cannot adequately support AI workloads, so network engineers must build specialized networks to accommodate their massive size. Continue Reading
-
Definition
10 Apr 2024
schema
In computer programming, a schema (pronounced SKEE-mah) is the organization or structure for a database, while in artificial intelligence (AI), a schema is a formal expression of an inference rule. Continue Reading
-
Tip
04 Apr 2024
How to build an enterprise generative AI tech stack
Generative AI tech stacks consist of key components like LLMs, vector databases and fine-tuning tools. The right tech stack can help enterprises maximize their generative AI ROI. Continue Reading
-
News
02 Apr 2024
Edge AI startup reveals GenAI accelerator, $120M fundraise
The startup introduced a new GenAI accelerator for PCs and smart vehicles. The vendor's growth highlights the shift to training GenAI workloads from the cloud to the edge. Continue Reading
-
News
28 Mar 2024
US AI policy for federal agencies requires transparency
The OMB's new policy calls for federal agencies to be transparent about AI use and designate chief AI officers to coordinate efforts. Continue Reading
-
Feature
27 Mar 2024
AI hardware vendors band together to challenge Nvidia
An industry group including Arm and Intel seeks to increase the number of options in the AI market and decrease developers' dependence on GPUs. Continue Reading
-
Podcast
25 Mar 2024
Security, bias risks inherent in GenAI black box models
Language models are stochastic models that generate output based on data upon which they have been trained. Often, these models are a closed black box. That leads to many problems. Continue Reading
-
News
22 Mar 2024
Nvidia partners, customers drive AI into data centers
Nvidia and its partners are providing the tools and infrastructure to build and deploy AI applications that companies say could transform their businesses. Continue Reading
-
News
21 Mar 2024
UN AI resolution marks global interest in rules, principles
While the United Nations' artificial intelligence resolution does not create legally binding rules, it might indicate which countries are thinking in that direction. Continue Reading
-
News
18 Mar 2024
Nvidia unveils new AI Blackwell chip, microservices and more
The vendor launched a barrage of AI tech including faster chips in its new Blackwell infrastructure and new microservices that enable enterprises to create custom applications. Continue Reading
-
Tip
18 Mar 2024
Compare enterprise generative AI deployment options
To pick the best generative AI deployment model for your organization, examine how cloud and on-premises approaches fit into your security, cost, infrastructure and network needs. Continue Reading
-
News
13 Mar 2024
Cerebras introduces next-gen AI chip for GenAI training
The new accelerator is for training large AI models. It powers the startup's CS-3 supercomputer, which is designed to train models that are 10 times larger than GPT-4 and Gemini. Continue Reading
-
Feature
13 Mar 2024
The need for common sense in AI systems
Building explainable and trustworthy AI systems is paramount. To get there, computer scientists Ron Brachman and Hector Levesque suggest infusing common sense into AI development. Continue Reading
-
News
12 Mar 2024
Meta intros two GPU training clusters for Llama 3
The Facebook parent company said the training clusters are part of its plans to grow its infrastructure and obtain 350,000 Nvidia H100 GPUs by the end of the year. Continue Reading
-
News
12 Mar 2024
Cohere tackles some generative AI challenges with Command-R
The startup's new large language model aims to address problems with factual accuracies in generative AI models. It also focuses on language problems, and cloud and API challenges. Continue Reading
-
News
11 Mar 2024
Elon Musk plans to take xAI chatbot Grok open source
The move comes nearly two weeks after the Tesla owner filed a lawsuit against OpenAI. It also comes as more vendors are providing open source options for enterprise users. Continue Reading
-
News
11 Mar 2024
Salesforce AI a work in progress for customers
While eager to embed AI in Salesforce software, customers acknowledge that planning, time and work are needed to make AI useful in the company's applications. Continue Reading
-
News
11 Mar 2024
Podcast: A look at SambaNova's open source AI strategy
Despite sometimes being seen as a direct competitor to Nvidia Systems, the AI hardware and software vendor tries to distinguish itself by focusing on training open source models. Continue Reading
-
News
07 Mar 2024
Microsoft whistleblower, OpenAI, the NYT, and ethical AI
The vendor has filed a memorandum to dismiss some of the arguments The New York Times made in its copyright lawsuit. However, it now faces criticism from its own software engineer. Continue Reading
-
News
05 Mar 2024
Box AI adds Microsoft Azure OpenAI Service integration
Box adds Microsoft Azure OpenAI Service to its lineup of AI tools for document summaries, joining Google's Vertex and OpenAI LLMs for users to choose from. Continue Reading
-
News
04 Mar 2024
AI race surges as Anthropic intros Claude 3
The new models have a larger context window and multimodal capabilities. They reflect the new level of normal in generative AI and the myriad model choices for enterprises. Continue Reading
-
News
29 Feb 2024
H2O.ai releases small language model: H2O-Danube-1.8B
The new model comes as the generative market continues to see the emergence of small language models. The models provide enterprises with better privacy and data controls. Continue Reading
-
News
29 Feb 2024
Collibra adds AI governance to data management platform
The data management vendor's new suite adds capabilities aimed at enabling enterprises to safely and securely use AI the same way data governance frameworks apply to data. Continue Reading
-
News
28 Feb 2024
Intel, Nvidia aim latest systems-on-a-chip at AI workstations
Nvidia's RTX 500 and RTX 1000 GPUs are for the lightest mobile workstations. Intel's Core Ultra with a built-in GPU can handle many AI-powered tasks on the workhorse computers. Continue Reading
-
Tip
27 Feb 2024
Gemini vs. ChatGPT: What's the difference?
ChatGPT took early lead among AI-generated chatbots before Google answered with Gemini, formerly Bard. While ChatGPT and Gemini perform similar tasks, there are differences. Continue Reading
-
News
26 Feb 2024
Microsoft allies with OpenAI rival Mistral AI
The tech giant is investing in the open source startup. The partnership means Mistral's premium models, including its new model, Mistral Large, will be available on Azure. Continue Reading
-
News
22 Feb 2024
Stability AI adopts new architecture in Stable Diffusion 3
The new version of the image model uses a different architecture than previous versions. It comes in different sizes and has better spelling capabilities. Continue Reading
-
News
22 Feb 2024
Intel Foundry launches as enterprise AI surges
If the trend continues, enterprises will need more AI chip suppliers to help stabilize prices and meet the demand for AI processing at the edge and the data center, experts said. Continue Reading
-
News
22 Feb 2024
AI vendor finds opportunity amid AI computing problem
The GPU cloud provider recently raised $320 million. It has found an opportunity as more enterprises seek to run generative models and the demand for infrastructure is high. Continue Reading
-
News
21 Feb 2024
Google releases new family of open models: Gemma
The cloud provider's new models compete with Meta's Llama 2 open source model. Google incorporates responsible AI standards that should appeal to enterprises. Continue Reading
-
Opinion
21 Feb 2024
AI news roundup: OpenAI video model, Nvidia chatbot and more
Explore last week's AI news highlights with analyst Mike Leone's roundup of top developments, including OpenAI's launch of video model Sora and Nvidia's locally running chatbot. Continue Reading
-
News
15 Feb 2024
Declining revenues lead to 4,000 job cuts at Cisco
A 12% drop in networking revenue contributed to the company's overall revenue decline and its decision to cut 5% of its workforce. Continue Reading
-
News
15 Feb 2024
Google updates AI model Gemini, adds 1M context window
The cloud provider's 1.5 Pro model has the largest context window seen in the market. Despite its innovation, it still needs to show the applicability of its model for enterprises. Continue Reading
-
News
15 Feb 2024
Startup intros new platform for AI inferencing at the edge
The platform delivers 5G and GPU-based micro clouds to specific locations. It's for enterprises that want an on-premises deployment without having to own the physical data center. Continue Reading
-
News
08 Feb 2024
Google turns Bard AI into Gemini, launches Gemini Advanced
The tech giant gave Bard a new name and introduced users to Ultra 1.0 through a new mobile app. It revealed changes to Duet AI and Cloud. Continue Reading
-
News
07 Feb 2024
Latest Cisco products show measured approach to AI
Cisco has launched a SaaS product for applying policy controls to AI model-bound data and an Nvidia partnership to bolster Cisco UCS servers for AI at the edge and data centers. Continue Reading
-
Definition
06 Feb 2024
explainable AI
Explainable AI (XAI) is artificial intelligence (AI) that's programmed to describe its purpose, rationale and decision-making process in a way that the average person can understand. Continue Reading
-
Feature
02 Feb 2024
AI, the 2024 U.S. election and the spread of disinformation
Generative technology-fueled deepfakes could interfere with the November election due to ease of use and power of the technology. The outlook for regulation seems dim. Continue Reading
-
Tip
31 Jan 2024
8 top generative AI tool categories for 2024
Need a generative AI-specific tool for your organization's development project? Explore the major categories these tools fall into and their capabilities. Continue Reading
-
Definition
30 Jan 2024
Retrieval-Augmented Language Model pre-training
A Retrieval-Augmented Language Model, also referred to as REALM or RALM, is an artificial intelligence language model designed to retrieve text and then use it to perform question-based tasks. Continue Reading
-
Tip
30 Jan 2024
AI model optimization: How to do it and why it matters
Challenges like model drift and operational inefficiency can plague AI models. These model optimization strategies can help engineers improve performance and mitigate issues. Continue Reading
-
News
29 Jan 2024
Juniper adds data center networks to Mist AI
Juniper's Mist AI update will include using the Marvis virtual network assistant to obtain information on data center cabling, configurations and connectivity issues. Continue Reading
-
News
25 Jan 2024
Google and Hugging Face unveil AI partnership
The partnership reflects the cloud provider's support of open source and aims to appeal to enterprises looking to move from ideation to implementation of generative AI workloads. Continue Reading
-
Feature
25 Jan 2024
A guide to artificial intelligence in the enterprise
AI in the enterprise is changing how work is done, but companies must overcome various challenges to derive value from this powerful and rapidly evolving technology. Continue Reading
-
News
24 Jan 2024
Equinix data centers offer Nvidia AI infrastructure
Equinix data centers will offer Nvidia DGX as a managed service. The platform is also available on public cloud providers, AWS, Google Cloud and Microsoft Azure. Continue Reading
-
News
23 Jan 2024
Oracle boosts generative AI service and intros new services
The tech giant's services are similar to what other big tech vendors have introduced. However, Oracle's input of GenAI technology in its SaaS applications makes it notable. Continue Reading
-
Tip
22 Jan 2024
Artificial intelligence vs. human intelligence: Differences explained
Artificial intelligence is humanlike. There are differences, however, between natural and artificial intelligence. Here are three ways AI and human cognition diverge. Continue Reading
-
Tip
18 Jan 2024
Why object storage for AI makes sense
Object storage and AI are two hot trends in tech. They exist on their own but work well together too. Still, organizations must also consider the challenges of using both. Continue Reading
-
Definition
18 Jan 2024
What is generative AI? Everything you need to know
Generative AI is a type of artificial intelligence technology that can produce various types of content, including text, imagery, audio and synthetic data. Continue Reading
-
Definition
16 Jan 2024
artificial intelligence (AI) governance
Artificial intelligence governance is the legal framework for ensuring AI and machine learning technologies are researched and developed with the goal of helping humanity adopt and use these systems in ethical and responsible ways. Continue Reading
-
News
16 Jan 2024
Podcast: Examining Microsoft VC M12's AI investment policy
M12 looks for new and transformative technologies. It also seeks to back companies that are focused on how the generative market might shift in the coming years. Continue Reading
-
News
09 Jan 2024
HPE to acquire Juniper Networks for $14 billion
HPE plans to use Juniper's hardware and software to make networking a 'core business and architecture foundation' for HPE's GreenLake hybrid cloud and AI platform. Continue Reading
-
Tip
05 Jan 2024
Learn how to create a machine learning pipeline
Well-considered machine learning pipelines provide a structured approach to AI development in modern IT environments, ensuring uniformity, speed and business alignment. Continue Reading
-
Feature
04 Jan 2024
10 top AI and machine learning trends for 2024
Custom enterprise models, open source AI, multimodal -- learn about the top AI and machine learning trends for 2024 and how they promise to transform the industry. Continue Reading
-
News
03 Jan 2024
The importance of the 'New York Times' AI copyright lawsuit
The newspaper publisher is the first major news outlet to sue the AI creator. While the suit might not reach court, it still has a significant impact on the AI community. Continue Reading
-
Podcast
02 Jan 2024
A challenge: Guiding generative AI toward responsible use
Transparency, explainability and lack of bias are principles for building generative AI systems that work according to ethical rules and are fair for everyone. Continue Reading
-
Feature
28 Dec 2023
Compare GPUs vs. CPUs for AI workloads
GPUs are often presented as the vehicle of choice to run AI workloads, but the push is on to expand the number and types of algorithms that can run efficiently on CPUs. Continue Reading
-
Definition
27 Dec 2023
hyperautomation
Hyperautomation is a framework and a set of advanced technologies for scaling automation in the enterprise. The ultimate goal of hyperautomation is to develop a process for automating enterprise automation. Continue Reading
-
Podcast
18 Dec 2023
2024 will see generative AI mature
Next year, enterprises will likely be inundated with more capabilities and applications for GenAI. The year will lead to smaller models and regulation in more countries. Continue Reading
-
News
15 Dec 2023
Intel Core Ultra CPUs with neural processing runs AI on PCs
Intel's Core Ultra CPUs now contain embedded AI neural processing, which adds options for device manufacturers to divide demand for AI processes among different hardware resources. Continue Reading
-
News
15 Dec 2023
Small language models an emerging GenAI force
Enterprises are unwilling to pay for large language models to accomplish simple business tasks with generative AI. They're looking at cheaper small language models. Continue Reading
-
News
13 Dec 2023
Google starts incorporating Gemini across AI software stack
The cloud provider revealed the model's API is now available in Studio and on its Vertex Platform. It also introduced new Duet offerings and a partnership with Mistral. Continue Reading
-
Feature
11 Dec 2023
Big money investments, not acquisitions, fuel GenAI startups
With the generative AI explosion comes a new trend for the tech giants. Instead of buying smaller companies, big cloud vendors are partnering with the startups. Continue Reading
-
Feature
08 Dec 2023
Generative AI as a copilot for finance and other sectors
While many fear that the popularity of large language models could lead to job loss and replacement, some industries such as finance and education are using AI to augment workers. Continue Reading
-
News
07 Dec 2023
Federal procurement of AI tools hurt by White House rules
Obstacles facing Biden's executive order on AI and draft OMB guidance include lack of clarity around AI procurement and inability of Congress to take action on AI policy. Continue Reading
-
News
07 Dec 2023
How generative AI is changing the fashion industry
The industry is using AI technology in design and skin care. Some companies are going beyond generative tools, using computer vision and augmented reality. Continue Reading
-
News
06 Dec 2023
AMD Instinct MI300 AI accelerator takes aim at Nvidia GPUs
Data center-grade GPUs and accelerators for enterprise customers and cloud vendors are the new battleground for AI hardware. AMD and Google advance the race with new chips. Continue Reading
-
News
05 Dec 2023
IBM, Meta form AI Alliance to promote open AI
Meta and IBM launched the new group with more than 50 other organizations to foster an open community that helps accelerate the development of responsible AI systems. Continue Reading
-
News
04 Dec 2023
AWS customers grapple with generative AI shortfalls
Enterprises fear failing to accommodate technical immaturity, ensure data security and governance, prevent biases built into models, and correct the erroneous responses of LLMs. Continue Reading
-
News
28 Nov 2023
AWS unveils new AI chatbot, chips, Nvidia partnership
The cloud giant's new chatbot is for enterprises looking for more productivity. It is infused into AWS applications for contact centers and business intelligence. Continue Reading
-
Definition
28 Nov 2023
ternary content-addressable memory (TCAM)
Ternary content-addressable memory (TCAM) is a specialized type of high-speed memory that searches its entire contents in a single clock cycle. Continue Reading
-
News
22 Nov 2023
OpenAI reinstates Sam Altman as CEO, but problems remain
The AI startup reinstated the former CEO after firing him on Nov. 17. The news seems to end a whirlwind of events that highlighted fundamental problems at the vendor. Continue Reading
-
Tip
21 Nov 2023
Top 8 AI hardware companies
Due to rapid AI hardware advancement, companies are releasing advanced products yearly to keep up with the competition. The new competitive product on the market is the AI chip. Continue Reading
-
News
20 Nov 2023
Fallout after Microsoft hires former OpenAI CEO Sam Altman
The tech giant snagged a major win by hiring Altman and former board member Greg Brockman. Meanwhile, the AI startup faces challenges with many employees threatening to quit. Continue Reading
-
Definition
16 Nov 2023
robo-advisor
A robo-advisor is a virtual financial advisor powered by artificial intelligence (AI) that employs an algorithm to deliver an automated selection of financial advisory services. Continue Reading
-
News
15 Nov 2023
Microsoft launches custom AI, server chips for Azure
Maia is for AI inferencing, while the Arm-based Cobalt is for general-purpose computing. The chips could compete with AMD, Nvidia and Intel offerings eventually. Continue Reading
-
News
13 Nov 2023
Nvidia intros new H200 for running large AI workloads
The chipmaker's new GPU has the latest high-bandwidth memory capacity to accelerate generative models and large language models. The chip focuses on optimizing CPUs and GPUs. Continue Reading
-
Definition
13 Nov 2023
vector search
Vector search, sometimes referred to as vector similarity search, is a technique that uses vectors -- numerical representations of data -- as the basis to conduct searches and identify relevance. Continue Reading
-
News
08 Nov 2023
OpenAI's new versions of GPT target more enterprises
The vendor targets enterprises with new tools and products tailored to what businesses and developers need to incorporate generative AI, including better pricing. Continue Reading
-
News
03 Nov 2023
AI executive order does not fix need for federal regulation
The executive order on AI focuses on data privacy, defense and many other areas. While it's seen as comprehensive, many are looking for the U.S. to enact federal legislation. Continue Reading
-
Tip
01 Nov 2023
Tips for planning a machine learning architecture
When planning a machine learning architecture, organizations must consider factors such as performance, cost and scalability. Review necessary components and best practices. Continue Reading
-
News
30 Oct 2023
Data center power constraints send AI everywhere
Most experts agree there isn't enough unused electricity to perform future AI processing in hyperscale data centers and colocation facilities as GenAI demand soars. Continue Reading
-
News
25 Oct 2023
Apple's silence on generative AI is characteristic
The consumer tech company is behind other tech giants in the generative AI race. However, stealth is the company's normal mode of operation with disruptive technologies. Continue Reading
-
Feature
20 Oct 2023
Generative AI's sustainability problems explained
Generative AI tools and LLMs such as ChatGPT have exploded onto the tech scene. Here's a look at what that costs the environment and how to decrease the negative impact. Continue Reading
-
Definition
18 Oct 2023
neural net processor
A neural net processor is a central processing unit (CPU) that holds the modeled workings of how a human brain operates on a single chip. Continue Reading
-
Definition
18 Oct 2023
prompt engineering
Prompt engineering is an AI engineering technique encompassing the process of refining LLMs with specific prompts and recommended outputs, as well as the process of refining input to various generative AI services to generate text or images. Continue Reading
-
Tip
13 Oct 2023
How to source AI infrastructure components
Rent, buy or repurpose AI infrastructure? The right choice depends on an organization's planned AI projects, budget, data privacy needs and technical personnel resources. Continue Reading
-
News
11 Oct 2023
AMD tries to differentiate in AI market with acquisition
The acquisition of the open source startup helps the chip vendor compete in the optimization market and offer flexibility compared to competitors such as Nvidia. Continue Reading
-
Definition
10 Oct 2023
neurosynaptic chip
A neurosynaptic chip, also known as a cognitive chip, is a computer processor that is designed to function more like a biological brain than a typical central processing unit (CPU). Continue Reading
-
Definition
05 Oct 2023
IBM Watson supercomputer
Watson was a supercomputer designed and developed by IBM. This advanced computer combined artificial intelligence (AI), automation and sophisticated analytics capabilities to deliver optimal performance as a 'question answering' machine. Continue Reading
-
News
25 Sep 2023
Amazon's $4B investment in Anthropic fuels GenAI race
This makes the tech giant Anthropic's primary cloud provider. This arrangement is different from Google's investment in the AI startup and Microsoft's investment in OpenAI. Continue Reading
-
News
21 Sep 2023
Oracle generative AI features differ from Microsoft offering
Oracle's generative AI service, available in beta, uses partner Cohere's second large language model to analyze text for the feelings and opinions behind it, according to Gartner. Continue Reading
-
Feature
21 Sep 2023
The future of generative AI: How will it impact the enterprise?
Learn how generative AI will affect organizations in terms of capabilities, enterprise workflows and ethics, and how the technology will shape enterprise use cases. Continue Reading
-
News
19 Sep 2023
SambaNova AI launches new chip: the SN40L
The AI hardware and software provider's new chip offers enterprises a full stack approach to training LLM. It also makes it possible for customers to train multimodal models. Continue Reading
-
News
18 Sep 2023
New Anyscale service enables fine-tuning of open source LLMs
The AI startup introduced a service that lets enterprises deploy large language models into their applications using popular LLM APIs like Llama 2. Continue Reading
-
News
14 Sep 2023
A venture capitalist's take on generative AI investment
Funding for startups such as Anthropic, Cohere and Hugging Face shows that money is still flowing into the market. However, the criteria for funding are still strict. Continue Reading
-
Opinion
13 Sep 2023
Why IT leaders should deploy generative AI infrastructure now
Organizations should seek to incorporate a generative AI product that is easy to use, flexible and on site. Nutanix GPT-in-a-Box is among the candidates with these options. Continue Reading
-
News
12 Sep 2023
How different industries apply generative AI
Large language models such as ChatGPT, Bard and Llama have changed how enterprises think about technology. While some can use new AI tools now, others face challenges. Continue Reading
-
Podcast
11 Sep 2023
The readiness of AI and LLM technology
Generative AI technology has already disrupted how enterprises work. While many companies are uneasy with the fast-evolving technology, that's expected to change. Continue Reading
-
Podcast
08 Sep 2023
Transforming healthcare with artificial intelligence
Discover how applying AI to healthcare can optimize the patient experience, drive efficiencies and reduce the burden of cardiovascular diseases on the global healthcare system. Continue Reading
-
News
06 Sep 2023
AI inference startup raises $110 million
Investors are betting on d-Matrix to be a long-lasting vendor in the AI hardware and compute market, despite Nvidia's leading status. The startup offers a low-cost option. Continue Reading