Update AdaGard.md for equation rendering

Shantnu-singh · web-flow · commit bae54c33efef · 2024-08-09T18:06:23.000+05:30
diff --git a/docs/Deep Learning/Optimizers in Deep Learning/AdaGard.md b/docs/Deep Learning/Optimizers in Deep Learning/AdaGard.md
@@ -30,21 +30,20 @@ The update rule for AdaGrad is as follows:
 
 1. Accumulate the squared gradients:
 
-      $
+      $$
       G_t = G_{t-1} + g_t^2
-      $
+      $$
 
 2. Update the parameters:
 
-    $
-    \theta_t = \theta_{t-1} - \frac{\eta}{\sqrt{G_t} + \epsilon} \cdot g_t
-    $
+    
+$$η = \theta_{t-1} - \frac{\eta}{\sqrt{G_t} + \epsilon} \cdot g_t$$
 
 where:
-- $ G_t $ is the accumulated sum of squares of gradients up to time step $ t $
-- $ g_t $ is the gradient at time step $ t $
-- $ \eta $ is the learning rate
-- $ \epsilon $ is a small constant to prevent division by zero
+- $G_t$ is the accumulated sum of squares of gradients up to time step $t$
+- $g_t$ is the gradient at time step $t$
+- $\eta$ is the learning rate
+- $\epsilon$ is a small constant to prevent division by zero
 
 ## Implementation in Keras
 
@@ -107,4 +106,4 @@ The results of the training process, including the loss and accuracy, will be di
 
 ## What Next
 
-To address these issues, various optimization algorithms have been developed, such as Adam,  which incorporate techniques. Which we'll see in next section .
+To address these issues, various optimization algorithms have been developed, such as Adam,  which incorporate techniques. Which we'll see in next section .