fix: title and missing bibtex

xukai92 · xukai92 · commit 076f4115bde7 · 2025-02-17T21:22:47.000-05:00
Signed-off-by: Kai Xu &lt;me@xuk.ai&gt;
diff --git a/_posts/2025-02-06-r1-like-reasoning-update-1.md b/_posts/2025-02-06-r1-like-reasoning-update-1.md
@@ -1,5 +1,5 @@
 ---
-title: Lessons on Reproducing R1-like Reasoning in Small LLMs without using DeepSeek-R1-Zero (or its derivatives) - Update 1
+title: Update 1 - Lessons on Reproducing R1-like Reasoning in Small LLMs without using DeepSeek-R1-Zero (or its derivatives)
 date: 2025-02-06
 ---
 
diff --git a/_posts/2025-02-07-r1-like-reasoning-update-2.md b/_posts/2025-02-07-r1-like-reasoning-update-2.md
@@ -1,5 +1,5 @@
 ---
-title: Lessons on Reproducing R1-like Reasoning in Small LLMs without using DeepSeek-R1-Zero (or its derivatives) - Update 2
+title: Update 2 - Lessons on Reproducing R1-like Reasoning in Small LLMs without using DeepSeek-R1-Zero (or its derivatives)
 date: 2025-02-07
 ---
 
diff --git a/_posts/2025-02-17-r1-like-reasoning-update-3.md b/_posts/2025-02-17-r1-like-reasoning-update-3.md
@@ -40,3 +40,16 @@ Inference-time scaling can be computationally expensive, requiring both the **tr
 In other words, **can we train the model to not only generate responses but also judge and refine its own drafts?** This could effectively **amortize** the cost of inference-time scaling by training the model to perform this reasoning process upfront.  
 
 Moving forward, we’ll be focusing our efforts on testing this hypothesis and will continue sharing our findings. Stay tuned!  
+
+---
+
+If you want to cite our work, you can use the following BibTeX entry of the original blog post.
+
+```bibtex
+@misc{srivastava2024lessonsonreproducing,  
+      title={Lessons on Reproducing R1-like Reasoning in Small LLMs without using DeepSeek-R1-Zero (or its derivatives)},  
+      author={Akash Srivastava, Isha Puri, Kai Xu, Shivchander Sudalairaj, Mustafa Eyceoz, Oleg Silkin, Abhishek Bhandwaldar, Aldo Genaro Pareja Cardona and GX Xu},  
+      url={https://red-hat-ai-innovation-team.github.io/posts/r1-like-reasoning},  
+      year={2025},  
+}  
+```