Skip to content

Commit 5420581

Browse files
authored
Merge pull request #37 from imohitmayank/deepseek
NLP>DeepSeek page added
2 parents e99e3fb + 7694e9a commit 5420581

File tree

8 files changed

+267
-2
lines changed

8 files changed

+267
-2
lines changed

docs/imgs/nlp_deepseek_archi.png

243 KB
Loading
169 KB
Loading

docs/imgs/nlp_deepseek_eval.png

292 KB
Loading

docs/imgs/nlp_deepseek_intro.png

414 KB
Loading

docs/imgs/nlp_deepseek_r1zero.png

165 KB
Loading

docs/machine_learning/interview_questions.md

Lines changed: 8 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -508,5 +508,12 @@
508508

509509
This model is commonly used to capture human preferences to provide a framework for understanding and incorporating human feedback into the training of language models.
510510

511+
!!! Question ""
512+
=== "Question"
513+
#### What is Rejection Sampling in Machine Learning?
514+
515+
=== "Answer"
516+
Rejection sampling is a method to generate samples from a complex target distribution (like a hard-to-sample probability curve) by using a simpler "proposal" distribution you can easily sample from (e.g., a uniform or normal distribution).
517+
518+
Here's how it works: you first pick a proposal distribution that covers the target’s range. Then, you repeatedly draw samples from this simpler distribution and "accept" or "reject" each sample based on a quality check—if a random number (from 0 to 1) is less than the ratio of the target’s density to the proposal’s density (scaled by a constant), you keep the sample; otherwise, you discard it. This process ensures the accepted samples match the target distribution. It’s like filtering out bad candidates until you’re left with samples that fit your desired pattern. While simple to implement, it becomes inefficient for high-dimensional data or if the proposal distribution doesn’t closely match the target shape.
511519

512-

docs/natural_language_processing/deepseek.md

Lines changed: 258 additions & 0 deletions
Large diffs are not rendered by default.

mkdocs.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -80,9 +80,9 @@ nav:
8080
- 'MiniLM': 'natural_language_processing/minilm.md'
8181
- 'natural_language_processing/T5.md'
8282
- 'natural_language_processing/FlanModels.md'
83-
# - 'ChatGPT': 'natural_language_processing/chatgpt.md'
8483
- 'LLaMA': 'natural_language_processing/llama.md'
8584
- 'Mamba': 'natural_language_processing/mamba.md'
85+
- 'DeepSeek R1': 'natural_language_processing/deepseek.md'
8686
- 'Large Language Models':
8787
- 'Training LLMs': 'natural_language_processing/training_llm.md'
8888
- 'Prompt Engineering': 'natural_language_processing/prompt_engineering.md'

0 commit comments

Comments
 (0)