Shakti Wadekar – Medium

Shakti Wadekar

Pinned

Published in
Towards AI

DeepSeek-R1: Model Architecture

This article provides an in-depth exploration of the DeepSeek-R1 model architecture. Let’s trace DeepSeek-R1 model from input to the output…

Feb 5

DeepSeek-R1: Model Architecture

Feb 5

Understanding Scalable Deployment Tools on AWS: AWS ECR, ECS, ALB, IAM and Secrets Manager

Deploying a Large Language Model (LLM) chat application that can scale efficiently on AWS requires understanding key AWS services. This…

Feb 17

Understanding Scalable Deployment Tools on AWS: AWS ECR, ECS, ALB, IAM and Secrets Manager

Feb 17

Learning to Build Scalable LLM Chat Application: Microservices Architecture and Docker…

📜 Table of Contents

Feb 14

Learning to Build Scalable LLM Chat Application: Microservices Architecture and Docker…

Feb 14

Published in
Towards AI

GRPO and DeepSeek-R1-Zero

📚 Table of Contents

Feb 7

GRPO and DeepSeek-R1-Zero

Feb 7

Published in
The Startup

DeepSeek-R1: Training Recipe and Data

For simplified understanding, the training pipeline of DeepSeek-R1 is presented in 6 stages. The official technical report describes it in…

Feb 5

DeepSeek-R1: Training Recipe and Data

Feb 5

Published in
The Startup

Evaluate Robustness of Convolutional Neural Networks (CNNs) with CIFAR100-C and CIFAR10-C datasets

CIFAR100-C and CIFAR10-C datasets explained and github code provided

Jan 12, 2022

Evaluate Robustness of Convolutional Neural Networks (CNNs) with CIFAR100-C and CIFAR10-C datasets

Jan 12, 2022

Published in
Geek Culture

Visualizing Hyperparameter Tuning Results of KerasTuner With Weights & Biases

My previous blog explains about how to use KerasTuner for hyperparameter tuning in Keras/TensorFlow 2. This article shows how to visualize…

Mar 8, 2021

Visualizing Hyperparameter Tuning Results of KerasTuner With Weights & Biases

Mar 8, 2021

Published in
The Startup

Optuna: Hyperparameter Optimization in PyTorch

Hyperparameter tuning of PyTorch models with Optuna

Jan 19, 2021

Optuna: Hyperparameter Optimization in PyTorch

Jan 19, 2021

Published in
Analytics Vidhya

Solution to TensorFlow 2 not using GPU

Making TensorFlow 2 code or Keras code run on GPU

Jan 16, 2021

Solution to TensorFlow 2 not using GPU

Jan 16, 2021

Published in
The Startup

Hyperparameter Tuning in Keras: TensorFlow 2: With Keras Tuner: RandomSearch, Hyperband…

This article will explore the options available in Keras Tuner for hyperparameter optimization with example TensorFlow 2 codes for…

Jan 10, 2021

Hyperparameter Tuning in Keras: TensorFlow 2: With Keras Tuner: RandomSearch, Hyperband…

Jan 10, 2021

Why Softmax not used when Cross-entropy-loss is used as loss function during Neural Network…

\large{Quick answer:}

Jan 3, 2021

Why Softmax not used when Cross-entropy-loss is used as loss function during Neural Network…

Jan 3, 2021

How to avoid numerical overflow in Sigmoid function: Numerically stable sigmoid function

Quick answer:

Jul 30, 2020

How to avoid numerical overflow in Sigmoid function: Numerically stable sigmoid function

Jul 30, 2020

Shakti Wadekar

Shakti Wadekar

Machine Learning Explorer

Following

Help
Status
About
Careers
Press
Blog
Privacy
Rules
Terms
Text to speech