Proximal Policy Optimization Explained

May 12, 2026
AI And Machine Learning
Comment off

Learn how Proximal Policy Optimization improves reinforcement learning stability and performance. Explore its theory, key concepts, and implementation.

QwenLong-L1.5: Long-Context Reasoning

May 12, 2026
AI And Machine Learning
Comment off

‘Learn how QwenLong-L1.5 enables long-context reasoning with advanced memory management and reinforcement learning, and how to run it on GPUs.’

Evaluating Reward Models with RewardBench 2

May 12, 2026
AI And Machine Learning
Comment off

‘ RewardBench 2 seeks to evaluate reward models. In this article, we describe its relevance, conception, and how to get started with using it.’

Serverless Inference with the the cloud provider Gradient Platform

May 12, 2026
AI And Machine Learning
Comment off

In this tutorial, we show how to access and use the new Serverless Inference feature from the cloud provider’s Gradient Platform.

The Swish Activation Function

May 12, 2026
AI And Machine Learning
Comment off

This blogpost is an in-depth discussion of the Google Brain paper titled “Searching for activation functions” which has since revived research into activation functions.

Step-by-step instructions for training YOLOv7 on a Custom Dataset

May 12, 2026
AI And Machine Learning
Comment off

Follow these step-by-step instructions to learn how to train YOLOv7 on custom datasets, and then test it with our sample demo on detecting objects with the Road Sign Detection dataset with Gradient’s Free GPU Notebooks

Unlock the Power of AI/ML and Managed OpenSearch

May 12, 2026
AI And Machine Learning
Comment off

As AI/ML continues to dominate the tech landscape, new tools emerge to streamline development and improve efficiency. This session explores how OpenSearch projects can benefit from AI/ML, delivering smarter, faster solutions.

Weather forecast using LSTM networks

May 12, 2026
AI And Machine Learning
Comment off

In this post, we presented the LSTM subclass and used it to construct a weather forecasting model. We proved its effectiveness as a subgroup of RNNs designed to detect patterns in data sequences, including numerical time series data.