forked from ggml-org/llama.cpp
-
Notifications
You must be signed in to change notification settings - Fork 19
CANN: llama-parallel 精度问题Bug修复 #16
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
zsq0216
wants to merge
1
commit into
noemotiovon:master
Choose a base branch
from
zsq0216:master
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Author
# ./build/bin/llama-parallel -m Qwen3-0.6B-Q4_0.gguf -np 8 -ns 128 --top-k 1 --junk 10 -c 16384 -ngl 99
build: 6277 (74f52f77) with cc (Ubuntu 11.4.0-1ubuntu1~22.04) 11.4.0 for aarch64-linux-gnu
llama_model_load_from_file_impl: using device CANN0 (Ascend910B2) - 62071 MiB free
llama_model_loader: loaded meta data with 32 key-value pairs and 310 tensors from Qwen3-0.6B-Q4_0.gguf (version GGUF V3 (latest))
llama_model_loader: Dumping metadata keys/values. Note: KV overrides do not apply in this output.
llama_model_loader: - kv 0: general.architecture str = qwen3
llama_model_loader: - kv 1: general.type str = model
llama_model_loader: - kv 2: general.name str = Qwen3-0.6B
llama_model_loader: - kv 3: general.basename str = Qwen3-0.6B
llama_model_loader: - kv 4: general.quantized_by str = Unsloth
llama_model_loader: - kv 5: general.size_label str = 0.6B
llama_model_loader: - kv 6: general.repo_url str = https://huggingface.co/unsloth
llama_model_loader: - kv 7: qwen3.block_count u32 = 28
llama_model_loader: - kv 8: qwen3.context_length u32 = 40960
llama_model_loader: - kv 9: qwen3.embedding_length u32 = 1024
llama_model_loader: - kv 10: qwen3.feed_forward_length u32 = 3072
llama_model_loader: - kv 11: qwen3.attention.head_count u32 = 16
llama_model_loader: - kv 12: qwen3.attention.head_count_kv u32 = 8
llama_model_loader: - kv 13: qwen3.rope.freq_base f32 = 1000000.000000
llama_model_loader: - kv 14: qwen3.attention.layer_norm_rms_epsilon f32 = 0.000001
llama_model_loader: - kv 15: qwen3.attention.key_length u32 = 128
llama_model_loader: - kv 16: qwen3.attention.value_length u32 = 128
llama_model_loader: - kv 17: tokenizer.ggml.model str = gpt2
llama_model_loader: - kv 18: tokenizer.ggml.pre str = qwen2
llama_model_loader: - kv 19: tokenizer.ggml.tokens arr[str,151936] = ["!", "\"", "#", "$", "%", "&", "'", ...
llama_model_loader: - kv 20: tokenizer.ggml.token_type arr[i32,151936] = [1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, ...
llama_model_loader: - kv 21: tokenizer.ggml.merges arr[str,151387] = ["Ġ Ġ", "ĠĠ ĠĠ", "i n", "Ġ t",...
llama_model_loader: - kv 22: tokenizer.ggml.eos_token_id u32 = 151645
llama_model_loader: - kv 23: tokenizer.ggml.padding_token_id u32 = 151654
llama_model_loader: - kv 24: tokenizer.ggml.add_bos_token bool = false
llama_model_loader: - kv 25: tokenizer.chat_template str = {%- if tools %}\n {{- '<|im_start|>...
llama_model_loader: - kv 26: general.quantization_version u32 = 2
llama_model_loader: - kv 27: general.file_type u32 = 2
llama_model_loader: - kv 28: quantize.imatrix.file str = Qwen3-0.6B-GGUF/imatrix_unsloth.dat
llama_model_loader: - kv 29: quantize.imatrix.dataset str = unsloth_calibration_Qwen3-0.6B.txt
llama_model_loader: - kv 30: quantize.imatrix.entries_count u32 = 196
llama_model_loader: - kv 31: quantize.imatrix.chunks_count u32 = 688
llama_model_loader: - type f32: 113 tensors
llama_model_loader: - type q4_0: 193 tensors
llama_model_loader: - type q4_1: 3 tensors
llama_model_loader: - type q6_K: 1 tensors
print_info: file format = GGUF V3 (latest)
print_info: file type = Q4_0
print_info: file size = 358.78 MiB (5.05 BPW)
load: printing all EOG tokens:
load: - 151643 ('<|endoftext|>')
load: - 151645 ('<|im_end|>')
load: - 151662 ('<|fim_pad|>')
load: - 151663 ('<|repo_name|>')
load: - 151664 ('<|file_sep|>')
load: special tokens cache size = 26
load: token to piece cache size = 0.9311 MB
print_info: arch = qwen3
print_info: vocab_only = 0
print_info: n_ctx_train = 40960
print_info: n_embd = 1024
print_info: n_layer = 28
print_info: n_head = 16
print_info: n_head_kv = 8
print_info: n_rot = 128
print_info: n_swa = 0
print_info: is_swa_any = 0
print_info: n_embd_head_k = 128
print_info: n_embd_head_v = 128
print_info: n_gqa = 2
print_info: n_embd_k_gqa = 1024
print_info: n_embd_v_gqa = 1024
print_info: f_norm_eps = 0.0e+00
print_info: f_norm_rms_eps = 1.0e-06
print_info: f_clamp_kqv = 0.0e+00
print_info: f_max_alibi_bias = 0.0e+00
print_info: f_logit_scale = 0.0e+00
print_info: f_attn_scale = 0.0e+00
print_info: n_ff = 3072
print_info: n_expert = 0
print_info: n_expert_used = 0
print_info: causal attn = 1
print_info: pooling type = -1
print_info: rope type = 2
print_info: rope scaling = linear
print_info: freq_base_train = 1000000.0
print_info: freq_scale_train = 1
print_info: n_ctx_orig_yarn = 40960
print_info: rope_finetuned = unknown
print_info: model type = 0.6B
print_info: model params = 596.05 M
print_info: general.name = Qwen3-0.6B
print_info: vocab type = BPE
print_info: n_vocab = 151936
print_info: n_merges = 151387
print_info: BOS token = 11 ','
print_info: EOS token = 151645 '<|im_end|>'
print_info: EOT token = 151645 '<|im_end|>'
print_info: PAD token = 151654 '<|vision_pad|>'
print_info: LF token = 198 'Ċ'
print_info: FIM PRE token = 151659 '<|fim_prefix|>'
print_info: FIM SUF token = 151661 '<|fim_suffix|>'
print_info: FIM MID token = 151660 '<|fim_middle|>'
print_info: FIM PAD token = 151662 '<|fim_pad|>'
print_info: FIM REP token = 151663 '<|repo_name|>'
print_info: FIM SEP token = 151664 '<|file_sep|>'
print_info: EOG token = 151643 '<|endoftext|>'
print_info: EOG token = 151645 '<|im_end|>'
print_info: EOG token = 151662 '<|fim_pad|>'
print_info: EOG token = 151663 '<|repo_name|>'
print_info: EOG token = 151664 '<|file_sep|>'
print_info: max token length = 256
load_tensors: loading model tensors, this can take a while... (mmap = true)
load_tensors: offloading 28 repeating layers to GPU
load_tensors: offloading output layer to GPU
load_tensors: offloaded 29/29 layers to GPU
load_tensors: CANN0 model buffer size = 231.44 MiB
load_tensors: CPU_Mapped model buffer size = 144.24 MiB
...................................................................
llama_context: constructing llama_context
llama_context: n_seq_max = 9
llama_context: n_ctx = 16384
llama_context: n_ctx_per_seq = 1820
llama_context: n_batch = 2048
llama_context: n_ubatch = 512
llama_context: causal_attn = 1
llama_context: flash_attn = 0
llama_context: kv_unified = false
llama_context: freq_base = 1000000.0
llama_context: freq_scale = 1
llama_context: n_ctx_per_seq (1820) < n_ctx_train (40960) -- the full capacity of the model will not be utilized
ggml_backend_cann_context: device 0 async operator submission is OFF
ggml_backend_cann_context: LLAMA_SET_ROWS is OFF
ggml_backend_cann_context: CANN Graph currently only supports execution when LLAMA_SET_ROWS is ON. Falling back to eager mode.
llama_context: CANN_Host output buffer size = 5.22 MiB
llama_kv_cache: CANN0 KV buffer size = 1795.50 MiB
llama_kv_cache: size = 1795.50 MiB ( 1824 cells, 28 layers, 9/9 seqs), K (f16): 897.75 MiB, V (f16): 897.75 MiB
llama_context: CANN0 compute buffer size = 77.16 MiB
llama_context: CANN_Host compute buffer size = 298.75 MiB
llama_context: graph nodes = 1098
llama_context: graph splits = 9
common_init_from_params: added <|endoftext|> logit bias = -inf
common_init_from_params: added <|im_end|> logit bias = -inf
common_init_from_params: added <|fim_pad|> logit bias = -inf
common_init_from_params: added <|repo_name|> logit bias = -inf
common_init_from_params: added <|file_sep|> logit bias = -inf
common_init_from_params: setting dry_penalty_last_n to ctx_size = 16416
common_init_from_params: warming up the model with an empty run - please wait ... (--no-warmup to disable)
new_pool_for_device: device 0 use vmm pool
No new questions so proceed with build-in defaults.
main: initializing samplers with different RNG seeds, starting from -1
main: Simulating parallel requests from clients:
main: n_parallel = 8, n_sequences = 128, cont_batching = 1, system tokens = 273
Processing requests ...
main: clearing the KV cache
Client 0, seq 0, junk = 9, prompt = 515, started decoding ...
Client 1, seq 1, junk = 6, prompt = 426, started decoding ...
Client 2, seq 2, junk = 0, prompt = 295, started decoding ...
Client 3, seq 3, junk = 8, prompt = 474, started decoding ...
Client 4, seq 4, junk = 5, prompt = 415, started decoding ...
Client 5, seq 5, junk = 4, prompt = 391, started decoding ...
Client 6, seq 6, junk = 6, prompt = 457, started decoding ...
Client 7, seq 7, junk = 4, prompt = 398, started decoding ...
Client 0, seq 0/128, prompt 515 t, response 21 t, time 23.24 s, speed 23.06 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live in the present moment and to be present to the present moment.
Client 0, seq 8, junk = 3, prompt = 364, started decoding ...
Client 6, seq 6/128, prompt 457 t, response 32 t, time 32.32 s, speed 15.13 t/s, cache miss 0
Input: What is the best way to cook a steak?
Response: The best way to cook a steak is to cook it over a high heat and then reduce the heat to medium heat and cook it for a few minutes.
Client 6, seq 9, junk = 5, prompt = 416, started decoding ...
Client 5, seq 5/128, prompt 391 t, response 34 t, time 35.52 s, speed 11.96 t/s, cache miss 0
Input: What is the best way to cook a steak?
Response: The best way to cook a steak is to cook it over a high heat and then lower the heat to a low heat. This is the classic steak cooking method.
Client 5, seq 10, junk = 7, prompt = 455, started decoding ...
Client 7, seq 7/128, prompt 398 t, response 38 t, time 39.72 s, speed 10.98 t/s, cache miss 0
Input: I want to learn how to play the piano. What would be the best way to do it?
Response: The best way to learn how to play the piano is to practice regularly, study music theory, and take lessons from a professional musician. It's important to practice and to keep learning.
Client 7, seq 11, junk = 1, prompt = 317, started decoding ...
Client 3, seq 3/128, prompt 474 t, response 39 t, time 41.23 s, speed 12.44 t/s, cache miss 0
Input: I want to learn how to play the piano. What would be the best way to do it?
Response: The best way to learn how to play the piano is to practice regularly, listen to music, and take lessons from a professional musician. It is important to practice and to practice every day.
Client 3, seq 12, junk = 3, prompt = 370, started decoding ...
Client 0, seq 8/128, prompt 364 t, response 18 t, time 20.39 s, speed 18.73 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live a meaningful life and to find joy in life.
Client 0, seq 13, junk = 8, prompt = 515, started decoding ...
Client 4, seq 4/128, prompt 415 t, response 45 t, time 48.83 s, speed 9.42 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree in computer science or a related field. You can apply to the Google job board, and you can also get a job through Google's internal recruitment process.
Client 4, seq 14, junk = 1, prompt = 310, started decoding ...
Client 1, seq 1/128, prompt 426 t, response 53 t, time 56.14 s, speed 8.53 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also apply to Google's internship program. You can also apply to Google's employee placement services.
Client 1, seq 15, junk = 5, prompt = 421, started decoding ...
Client 5, seq 10/128, prompt 455 t, response 23 t, time 26.01 s, speed 18.38 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live in the present moment and to find joy in the simple things in life.
Client 5, seq 16, junk = 0, prompt = 287, started decoding ...
Client 0, seq 13/128, prompt 515 t, response 45 t, time 40.19 s, speed 13.93 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also get a job through Google's own job market.
Client 0, seq 17, junk = 8, prompt = 472, started decoding ...
Client 3, seq 12/128, prompt 370 t, response 49 t, time 45.60 s, speed 9.19 t/s, cache miss 0
Input: Are you familiar with the Special Theory of Relativity and can you explain it to me?
Response: Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds relative to each other. It is a fundamental part of physics and is used in many areas of science and technology.
Client 3, seq 18, junk = 0, prompt = 284, started decoding ...
Client 7, seq 11/128, prompt 317 t, response 58 t, time 54.09 s, speed 6.93 t/s, cache miss 0
Input: If you could have any superpower, what would it be?
Response: I would like to have the power to see the future. I would like to see the future in a way that allows me to make a difference in the world. I would like to be able to see the future in a way that allows me to make a difference in the world.
Client 7, seq 19, junk = 8, prompt = 501, started decoding ...
Client 0, seq 17/128, prompt 472 t, response 13 t, time 12.51 s, speed 38.78 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live in the present moment.
Client 0, seq 20, junk = 1, prompt = 311, started decoding ...
Client 2, seq 2/128, prompt 295 t, response 102 t, time 98.54 s, speed 4.03 t/s, cache miss 0
Input: Are you familiar with the Special Theory of Relativity and can you explain it to me?
Response: The Special Theory of Relativity is a theory that describes the relationship between space and time. It states that the laws of physics are the same in all inertial frames of reference. This means that the speed of light is constant in a given frame of reference, and that the laws of physics are the same in all inertial frames of reference. The theory was developed by Einstein in 1905. It is a fundamental part of modern physics and is used in many areas of science and technology.
Client 6, seq 9/128, prompt 416 t, response 69 t, time 66.30 s, speed 7.32 t/s, cache miss 0
Input: Are you familiar with the Special Theory of Relativity and can you explain it to me?
Response: Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds relative to each other. It states that time, length, and mass are all affected by the speed of an object in motion. The theory was developed by Einstein and is one of the most important theories in physics.
Client 2, seq 21, junk = 9, prompt = 502, started decoding ...
Client 6, seq 22, junk = 3, prompt = 366, started decoding ...
Client 1, seq 15/128, prompt 421 t, response 50 t, time 46.71 s, speed 10.08 t/s, cache miss 0
Input: Are you familiar with the Special Theory of Relativity and can you explain it to me?
Response: Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at speeds close to the speed of light. It is a fundamental theory of physics and is one of the most important theories in physics.
Client 1, seq 23, junk = 8, prompt = 477, started decoding ...
Client 4, seq 14/128, prompt 310 t, response 98 t, time 84.58 s, speed 4.82 t/s, cache miss 0
Input: Recommend some interesting books to read.
Response: I recommend "The Alchemist" by Paulo Coelho. It is a classic book about the journey of a person who starts with a dream and works to achieve it. It is a story about the journey of a person who starts with a dream and works to achieve it. It is a story about the journey of a person who starts with a dream and works to achieve it. It is a story about the journey of a person who starts with a dream and works to achieve it.
Client 4, seq 24, junk = 0, prompt = 285, started decoding ...
Client 2, seq 21/128, prompt 502 t, response 42 t, time 37.11 s, speed 14.66 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also apply to Google's internship program.
Client 2, seq 25, junk = 1, prompt = 313, started decoding ...
Client 1, seq 23/128, prompt 477 t, response 41 t, time 35.00 s, speed 14.80 t/s, cache miss 0
Input: Tell me an interesting fact about llamas.
Response: Llamas are a type of camel used for herding and grazing in the Andes region of South America. They are known for their strong and powerful bodies and their ability to run very fast.
Client 1, seq 26, junk = 0, prompt = 290, started decoding ...
Client 7, seq 19/128, prompt 501 t, response 57 t, time 52.70 s, speed 10.59 t/s, cache miss 0
Input: I want to learn how to play the piano. What would be the best way to do it?
Response: The best way to learn how to play the piano is to practice regularly, listen to music, and take lessons from a teacher. It's important to start with a simple melody and gradually increase the complexity. Also, it's helpful to practice with a friend or a music teacher.
Client 7, seq 27, junk = 4, prompt = 381, started decoding ...
Client 3, seq 18/128, prompt 284 t, response 74 t, time 67.30 s, speed 5.32 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is a philosophical question that varies across different cultures and religions. It is a question that has been asked by humans for thousands of years. The answer can vary depending on the person's beliefs and experiences. It is a question that can be answered in many ways, and it is a personal question that depends on the individual's own beliefs and experiences.
Client 3, seq 28, junk = 7, prompt = 462, started decoding ...
Client 6, seq 22/128, prompt 366 t, response 75 t, time 67.50 s, speed 6.53 t/s, cache miss 0
Input: I want to learn how to play the piano. What would be the best way to do it?
Response: The best way to learn how to play the piano is to practice regularly, find a good teacher, and practice with a good instrument. It's important to practice and to practice every day. Also, it's important to find a good way to practice, such as using a piano or a keyboard. The best way to learn is to practice and to practice every day.
Client 6, seq 29, junk = 1, prompt = 307, started decoding ...
Client 3, seq 28/128, prompt 462 t, response 19 t, time 17.89 s, speed 26.89 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live in the present moment and to be present with others.
Client 3, seq 30, junk = 8, prompt = 521, started decoding ...
Client 5, seq 16/128, prompt 287 t, response 128 t, time 114.70 s, speed 3.62 t/s, cache miss 0
Input: What is the best way to cook a steak?
Response: The best way to cook a steak is to cook it over a high heat and then lower the heat to medium heat. The steak should be cooked to a certain level of doneness, typically between 100-130 degrees Fahrenheit. The cooking time depends on the thickness of the steak and the desired level of doneness. The steak should be cooked to a certain level of doneness, typically between 100-130 degrees Fahrenheit. The cooking time depends on the thickness of the steak and the desired level of doneness. The steak should be cooked to a certain level of doneness, typically between 1
Client 5, seq 31, junk = 5, prompt = 399, started decoding ...
Client 0, seq 20/128, prompt 311 t, response 98 t, time 90.48 s, speed 4.52 t/s, cache miss 0
Input: Recommend some interesting books to read.
Response: I recommend "The Alchemist" by Paulo Coelho. It is a classic book about the journey of a person who starts with a dream and works to achieve it. It is a story about the life of a person who starts with a dream and works to achieve it. It is a story about the life of a person who starts with a dream and works to achieve it. It is a story about the life of a person who starts with a dream and works to achieve it.
Client 0, seq 32, junk = 5, prompt = 417, started decoding ...
Client 2, seq 25/128, prompt 313 t, response 54 t, time 52.69 s, speed 6.97 t/s, cache miss 0
Input: Tell me an interesting fact about llamas.
Response: Llamas are a type of camel used for herding and grazing in the Andes region of South America. They are known for their strong and powerful muscles and their ability to run long distances. They are also known for their unique social behavior and communication methods.
Client 2, seq 33, junk = 5, prompt = 425, started decoding ...
Client 6, seq 29/128, prompt 307 t, response 39 t, time 37.09 s, speed 9.33 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live in the present moment and to find joy in every moment. It is a simple and profound concept that has been spoken about in many religions and philosophies.
Client 6, seq 34, junk = 7, prompt = 451, started decoding ...
Client 7, seq 27/128, prompt 381 t, response 63 t, time 59.00 s, speed 7.53 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also apply to Google's internship program. You can also apply to Google's employee placement program. You can also apply to Google's recruitment process.
Client 7, seq 35, junk = 0, prompt = 284, started decoding ...
Client 0, seq 32/128, prompt 417 t, response 29 t, time 26.80 s, speed 16.64 t/s, cache miss 0
Input: What is the best way to learn a new language?
Response: The best way to learn a new language is to practice it daily, speak with native speakers, and immerse yourself in the language culture.
Client 0, seq 36, junk = 8, prompt = 500, started decoding ...
Client 3, seq 30/128, prompt 521 t, response 49 t, time 46.30 s, speed 12.31 t/s, cache miss 0
Input: Are you familiar with the Special Theory of Relativity and can you explain it to me?
Response: Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds relative to each other. It is a fundamental part of physics and is used in many areas of science and technology.
Client 3, seq 37, junk = 0, prompt = 290, started decoding ...
Client 6, seq 34/128, prompt 451 t, response 19 t, time 19.59 s, speed 23.99 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live in the present moment and to be present with others.
Client 6, seq 38, junk = 4, prompt = 383, started decoding ...
Client 1, seq 26/128, prompt 290 t, response 100 t, time 93.20 s, speed 4.18 t/s, cache miss 0
Input: If you could have any superpower, what would it be?
Response: I would like to have the power to see into the future. I would like to see the future in detail. I would like to know what happens in the future. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail.
Client 2, seq 33/128, prompt 425 t, response 46 t, time 42.80 s, speed 11.00 t/s, cache miss 0
Input: Tell me an interesting fact about llamas.
Response: Llamas are a type of camel used for herding and transporting goods. They are known for their strong and powerful legs and their ability to run very fast. They are also known for their ability to carry heavy loads.
Client 1, seq 39, junk = 9, prompt = 520, started decoding ...
Client 2, seq 40, junk = 5, prompt = 424, started decoding ...
Client 0, seq 36/128, prompt 500 t, response 19 t, time 22.09 s, speed 23.49 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live in the present moment and to be present to others.
Client 0, seq 41, junk = 7, prompt = 460, started decoding ...
Client 5, seq 31/128, prompt 399 t, response 65 t, time 63.39 s, speed 7.32 t/s, cache miss 0
Input: Tell me an interesting fact about llamas.
Response: Llamas are animals that live in the Andes mountains. They are known for their wool and their ability to carry heavy loads. They are also known for their strong and powerful legs and their ability to run very fast. They are also known for their ability to climb trees and for their ability to carry heavy loads.
Client 5, seq 42, junk = 1, prompt = 315, started decoding ...
Client 4, seq 24/128, prompt 285 t, response 128 t, time 122.60 s, speed 3.37 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree or a bachelor's degree. You can apply for internships or part-time jobs at Google. You can also get a job through Google's career center. You can also get a job through Google's job board. You can also get a job through Google's internship program. You can also get a job through Google's employee portal. You can also get a job through Google's job search. You can also get a job through Google's job board. You can also get a job through Google's career center. You can also get a job through Google's employee portal.
User
Client 4, seq 43, junk = 8, prompt = 491, started decoding ...
Client 1, seq 39/128, prompt 520 t, response 30 t, time 30.20 s, speed 18.21 t/s, cache miss 0
Input: If you could have any superpower, what would it be?
Response: I would like to have the power to see the future. I would like to see the future, and I would like to see the future.
Client 1, seq 44, junk = 0, prompt = 285, started decoding ...
Client 2, seq 40/128, prompt 424 t, response 46 t, time 43.59 s, speed 10.78 t/s, cache miss 0
Input: Tell me an interesting fact about llamas.
Response: Llamas are a type of camel used for herding and grazing. They are also known as "camel in the desert" and "camel in the field". They are native to the Andes Mountains in South America.
Client 2, seq 45, junk = 8, prompt = 490, started decoding ...
Client 6, seq 38/128, prompt 383 t, response 75 t, time 68.31 s, speed 6.70 t/s, cache miss 0
Input: Are you familiar with the Special Theory of Relativity and can you explain it to me?
Response: Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds. It states that objects in motion have a relativistic effect on their speed. The theory was developed by Albert Einstein in 1905. It is a fundamental part of modern physics and is used in many areas of science and technology.
Client 6, seq 46, junk = 7, prompt = 470, started decoding ...
Client 2, seq 45/128, prompt 490 t, response 21 t, time 19.70 s, speed 25.94 t/s, cache miss 0
Input: What is the best way to learn a new language?
Response: The best way to learn a new language is to practice it daily and immerse yourself in it.
Client 2, seq 47, junk = 2, prompt = 339, started decoding ...
Client 0, seq 41/128, prompt 460 t, response 67 t, time 61.00 s, speed 8.64 t/s, cache miss 0
Input: Recommend some interesting books to read.
Response: I recommend "The Catcher in the Rye" by J.D. Salinger, "To Kill a Mockingbird" by Harper Lee, "To Kill a Mockingbird" by Harper Lee, and "The Grapes of Wrath" by John Steinbeck. These books are all classics and offer a great experience.
Client 0, seq 48, junk = 5, prompt = 398, started decoding ...
Client 7, seq 35/128, prompt 284 t, response 98 t, time 92.71 s, speed 4.12 t/s, cache miss 0
Input: Recommend some interesting books to read.
Response: I recommend "The Alchemist" by Paulo Coelho. It is a classic book about the journey of a person who starts with a dream and works to achieve it. It is a story about the life of a person who starts with a dream and works to achieve it. It is a story about the life of a person who starts with a dream and works to achieve it. It is a story about the life of a person who starts with a dream and works to achieve it.
Client 7, seq 49, junk = 5, prompt = 414, started decoding ...
Client 4, seq 43/128, prompt 491 t, response 47 t, time 45.30 s, speed 11.88 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree in computer science or a related field. You can apply for internships or full-time positions at Google. You can also look for opportunities in other companies or organizations.
Client 4, seq 50, junk = 9, prompt = 528, started decoding ...
Client 3, seq 37/128, prompt 290 t, response 100 t, time 95.80 s, speed 4.07 t/s, cache miss 0
Input: If you could have any superpower, what would it be?
Response: I would like to have the power to see into the future. I would like to see the future in detail. I would like to know what happens in the future. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail.
Client 3, seq 51, junk = 7, prompt = 466, started decoding ...
Client 5, seq 42/128, prompt 315 t, response 86 t, time 80.09 s, speed 5.01 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also apply to Google's internship program. You can also apply to Google's campus recruitment program. You can also apply to Google's employee placement program. You can also apply to Google's job search program. You can also apply to Google's online job platform.
Client 5, seq 52, junk = 1, prompt = 327, started decoding ...
Client 6, seq 46/128, prompt 470 t, response 28 t, time 32.30 s, speed 15.42 t/s, cache miss 0
Input: What is the best way to cook a steak?
Response: The best way to cook a steak is to cook it over a high heat and then to cook it with a good amount of oil.
Client 6, seq 53, junk = 8, prompt = 465, started decoding ...
Client 6, seq 53/128, prompt 465 t, response 26 t, time 20.99 s, speed 23.39 t/s, cache miss 0
Input: What is the best way to learn a new language?
Response: The best way to learn a new language is to practice it daily and immerse yourself in the language as much as possible.
Client 6, seq 54, junk = 6, prompt = 448, started decoding ...
Client 4, seq 50/128, prompt 528 t, response 49 t, time 45.30 s, speed 12.74 t/s, cache miss 0
Input: Are you familiar with the Special Theory of Relativity and can you explain it to me?
Response: Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds relative to each other. It is a fundamental part of physics and is used in many areas of science and technology.
Client 4, seq 55, junk = 3, prompt = 374, started decoding ...
Client 3, seq 51/128, prompt 466 t, response 39 t, time 37.09 s, speed 13.61 t/s, cache miss 0
Input: I want to learn how to play the piano. What would be the best way to do it?
Response: The best way to learn how to play the piano is to practice regularly, find a good teacher, and practice with a good instrument. It's important to practice and to practice every day.
Client 3, seq 56, junk = 1, prompt = 313, started decoding ...
Client 1, seq 44/128, prompt 285 t, response 99 t, time 93.50 s, speed 4.11 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree or a bachelor's degree. You can apply for internships or part-time jobs. You can also get a job through Google's career center. You can also get a job through Google's job board. You can also get a job through Google's online courses. You can also get a job through Google's online courses and other online resources. You can also get a job through Google's online courses and other online resources.
Client 1, seq 57, junk = 8, prompt = 487, started decoding ...
Client 7, seq 49/128, prompt 414 t, response 60 t, time 59.10 s, speed 8.02 t/s, cache miss 0
Input: Tell me an interesting fact about llamas.
Response: Llamas are a type of camel that live in the Andes Mountains. They are known for their ability to climb trees and their strong legs. They are also known for their ability to carry heavy loads. They are a popular source of meat and are often found in the Andes region.
Client 7, seq 58, junk = 4, prompt = 405, started decoding ...
Client 0, seq 48/128, prompt 398 t, response 77 t, time 72.70 s, speed 6.53 t/s, cache miss 0
Input: Recommend some interesting books to read.
Response: I recommend "The Alchemist" by Paulo Coelho, "The Catcher in the Rye" by J.D. Salinger, "To Kill a Mockingbird" by Harper Lee, "The Lord of the Rings" by J.R.R. Tolkien, and "The Hobbit" by J.R.R. Tolkien. These books are all highly recommended for reading.
Client 0, seq 59, junk = 8, prompt = 499, started decoding ...
Client 4, seq 55/128, prompt 374 t, response 36 t, time 33.10 s, speed 12.39 t/s, cache miss 0
Input: What is the best way to cook a steak?
Response: The best way to cook a steak is to cook it over a high heat and then let it rest for a few minutes. This is a common method used in many restaurants.
Client 4, seq 60, junk = 0, prompt = 290, started decoding ...
Client 1, seq 57/128, prompt 487 t, response 34 t, time 29.89 s, speed 17.43 t/s, cache miss 0
Input: Tell me an interesting fact about llamas.
Response: Llamas are a type of camel used for herding and grazing. They are known for their ability to walk on water and their ability to carry heavy loads.
Client 1, seq 61, junk = 0, prompt = 286, started decoding ...
Client 2, seq 47/128, prompt 339 t, response 103 t, time 96.80 s, speed 4.57 t/s, cache miss 0
Input: Are you familiar with the Special Theory of Relativity and can you explain it to me?
Response: Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds relative to each other. It is also known as the theory of relativity. The theory states that the speed of light is constant in a vacuum, and that objects moving at high speeds relative to each other experience time dilation. The theory also states that the laws of physics are the same in all inertial frames of reference. The theory is one of the most important theories in physics.
Client 2, seq 62, junk = 4, prompt = 385, started decoding ...
Client 5, seq 52/128, prompt 327 t, response 86 t, time 78.29 s, speed 5.28 t/s, cache miss 0
Input: I want to learn how to play the piano. What would be the best way to do it?
Response: The best way to learn how to play the piano is to start with a basic piano and practice regularly. You can use online resources, such as YouTube tutorials, to learn the basics. It's important to practice consistently and to take notes on your progress. Also, consider joining a piano club or community to get support and practice with others. The key is to start with the basics and gradually build up to more complex skills.
Client 5, seq 63, junk = 1, prompt = 305, started decoding ...
Client 7, seq 58/128, prompt 405 t, response 58 t, time 49.90 s, speed 9.28 t/s, cache miss 0
Input: I want to learn how to play the piano. What would be the best way to do it?
Response: The best way to learn how to play the piano is to practice regularly, study music theory, and take lessons from a professional musician. It's important to start with a simple piece and gradually increase the complexity. Also, it's helpful to practice with a friend or a music teacher.
Client 7, seq 64, junk = 1, prompt = 314, started decoding ...
Client 3, seq 56/128, prompt 313 t, response 66 t, time 59.59 s, speed 6.36 t/s, cache miss 0
Input: If you could have any superpower, what would it be?
Response: I would like to have the power to see the future. I would like to be able to predict the future. I would like to know the future in advance. I would like to know the future in advance, and I would like to know the future in advance, and I would like to know the future in advance.
Client 3, seq 65, junk = 2, prompt = 336, started decoding ...
Client 5, seq 63/128, prompt 305 t, response 20 t, time 18.51 s, speed 17.56 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live a good life and to make a good contribution to society.
Client 5, seq 66, junk = 7, prompt = 464, started decoding ...
Client 2, seq 62/128, prompt 385 t, response 31 t, time 29.50 s, speed 14.10 t/s, cache miss 0
Input: If you could have any superpower, what would it be?
Response: I would like to be able to fly. I would like to fly to the top of the world. I would like to fly to the sky.
Client 2, seq 67, junk = 6, prompt = 442, started decoding ...
Client 6, seq 54/128, prompt 448 t, response 92 t, time 85.81 s, speed 6.29 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's campus programs, which are open to all students. You can also apply to Google's online job portals, which are available for all individuals. You can also apply to Google's internship programs, which are available for students and professionals. You can also apply to Google's employee placement services, which are available for all individuals.
Client 6, seq 68, junk = 7, prompt = 474, started decoding ...
Client 0, seq 59/128, prompt 499 t, response 69 t, time 63.39 s, speed 8.96 t/s, cache miss 0
Input: Are you familiar with the Special Theory of Relativity and can you explain it to me?
Response: Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds. It states that objects in motion have a relativistic mass and that time and space are affected by the speed of the object. The theory was developed by Einstein and is one of the most important theories in physics.
Client 0, seq 69, junk = 5, prompt = 405, started decoding ...
Client 2, seq 67/128, prompt 442 t, response 20 t, time 18.70 s, speed 24.71 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live a meaningful life and to find joy and purpose in life.
Client 2, seq 70, junk = 8, prompt = 474, started decoding ...
Client 1, seq 61/128, prompt 286 t, response 71 t, time 65.11 s, speed 5.48 t/s, cache miss 0
Input: Tell me an interesting fact about llamas.
Response: Llamas are a type of camel used for herding and grazing in the Andes region of South America. They are known for their ability to walk on water and their ability to carry heavy loads. They are also known for their intelligence and adaptability. They are a symbol of the Andes culture and are often seen in Andes folklore.
Client 1, seq 71, junk = 8, prompt = 488, started decoding ...
Client 5, seq 66/128, prompt 464 t, response 46 t, time 42.40 s, speed 12.03 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree in computer science or a related field. You can apply for internships or full-time positions at Google. You can also look for opportunities in the tech industry.
Client 5, seq 72, junk = 4, prompt = 390, started decoding ...
Client 3, seq 65/128, prompt 336 t, response 58 t, time 53.81 s, speed 7.32 t/s, cache miss 0
Input: What is the best way to cook a steak?
Response: The best way to cook a steak is to cook it over a high heat and then lower the heat to a low heat. This is called "steaming" and it's a popular method. The steak should be cooked to the desired level of doneness and should be seasoned properly.
Client 3, seq 73, junk = 9, prompt = 528, started decoding ...
Client 4, seq 60/128, prompt 290 t, response 100 t, time 92.39 s, speed 4.22 t/s, cache miss 0
Input: If you could have any superpower, what would it be?
Response: I would like to have the power to see into the future. I would like to see the future in detail. I would like to know what happens in the future. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail.
Client 4, seq 74, junk = 3, prompt = 363, started decoding ...
Client 0, seq 69/128, prompt 405 t, response 46 t, time 43.09 s, speed 10.47 t/s, cache miss 0
Input: Recommend some interesting books to read.
Response: I recommend "The Catcher in the Rye" by J.D. Salinger and "To Kill a Mockingbird" by Harper Lee. These books are both classic fiction and have a lot of depth and meaning.
Client 6, seq 68/128, prompt 474 t, response 48 t, time 45.69 s, speed 11.42 t/s, cache miss 0
Input: I want to learn how to play the piano. What would be the best way to do it?
Response: The best way to learn how to play the piano is to practice regularly, study music theory, and take lessons from a professional musician. It's important to start with a simple piece and gradually increase the complexity of the music you learn.
Client 0, seq 75, junk = 5, prompt = 413, started decoding ...
Client 6, seq 76, junk = 3, prompt = 372, started decoding ...
Client 1, seq 71/128, prompt 488 t, response 30 t, time 31.20 s, speed 16.60 t/s, cache miss 0
Input: Tell me an interesting fact about llamas.
Response: llamas are animals that live in the Andes mountains in South America. They are known for their wool and their ability to carry heavy loads.
Client 1, seq 77, junk = 4, prompt = 383, started decoding ...
Client 3, seq 73/128, prompt 528 t, response 19 t, time 21.89 s, speed 24.99 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live in the present moment and to be present with others.
Client 3, seq 78, junk = 2, prompt = 329, started decoding ...
Client 5, seq 72/128, prompt 390 t, response 35 t, time 36.90 s, speed 11.52 t/s, cache miss 0
Input: What is the best way to cook a steak?
Response: The best way to cook a steak is to cook it over a high heat and then reduce the heat to medium heat. This is a classic method used in many restaurants.
Client 5, seq 79, junk = 6, prompt = 447, started decoding ...
Client 2, seq 70/128, prompt 474 t, response 58 t, time 58.50 s, speed 9.09 t/s, cache miss 0
Input: Recommend some interesting books to read.
Response: I recommend "The Catcher in the Rye" by J.D. Salinger, "To Kill a Mockingbird" by Harper Lee, and "To Kill a Mockingbird" by Harper Lee. These books are all classics and have a lot of depth and meaning.
Client 2, seq 80, junk = 3, prompt = 360, started decoding ...
Client 0, seq 75/128, prompt 413 t, response 24 t, time 27.20 s, speed 16.07 t/s, cache miss 0
Input: If you could have any superpower, what would it be?
Response: I would like to have the power to see the future. I would like to be able to change the world.
Client 0, seq 81, junk = 1, prompt = 299, started decoding ...
Client 1, seq 77/128, prompt 383 t, response 25 t, time 26.11 s, speed 15.63 t/s, cache miss 0
Input: What is the best way to learn a new language?
Response: The best way to learn a new language is to practice it daily and immerse yourself in it as much as possible.
Client 1, seq 82, junk = 4, prompt = 382, started decoding ...
Client 7, seq 64/128, prompt 314 t, response 109 t, time 106.20 s, speed 3.98 t/s, cache miss 0
Input: Tell me an interesting fact about llamas.
Response: Llamas are a type of camel that are native to South America. They are known for their ability to walk on water and their ability to carry heavy loads. They are also known for their ability to climb trees and their ability to swim in water. They are also known for their ability to carry heavy loads and their ability to walk on water. They are also known for their ability to carry heavy loads and their ability to walk on water. They are also known for their ability to carry heavy loads and their ability to walk on water.
Client 7, seq 83, junk = 2, prompt = 336, started decoding ...
Client 6, seq 76/128, prompt 372 t, response 49 t, time 48.58 s, speed 8.67 t/s, cache miss 0
Input: Are you familiar with the Special Theory of Relativity and can you explain it to me?
Response: Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds relative to each other. It is a fundamental part of physics and is used in many areas of science and technology.
Client 6, seq 84, junk = 0, prompt = 290, started decoding ...
Client 5, seq 79/128, prompt 447 t, response 37 t, time 34.99 s, speed 13.83 t/s, cache miss 0
Input: If you could have any superpower, what would it be?
Response: I would like to have the ability to see into the future. I would like to be able to travel to other dimensions. I would like to be able to change the future.
Client 5, seq 85, junk = 7, prompt = 441, started decoding ...
Client 0, seq 81/128, prompt 299 t, response 40 t, time 35.49 s, speed 9.55 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live in the present moment and to find joy in every moment. It is a simple and profound concept that has been practiced by many people across different cultures and religions.
Client 0, seq 86, junk = 3, prompt = 367, started decoding ...
Client 2, seq 80/128, prompt 360 t, response 60 t, time 51.49 s, speed 8.16 t/s, cache miss 0
Input: Tell me an interesting fact about llamas.
Response: Lamas are a type of llama that are known for their unique behavior and are often found in the Andes region of South America. They are also known as "the llamas of the Andes" and are considered to be one of the most important species of llamas in the world.
Client 2, seq 87, junk = 8, prompt = 495, started decoding ...
Client 5, seq 85/128, prompt 441 t, response 44 t, time 35.70 s, speed 13.58 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree in computer science or a related field. You can apply to the Google job board, and you can also get a job through Google's internal programs.
Client 5, seq 88, junk = 5, prompt = 426, started decoding ...
Client 4, seq 74/128, prompt 363 t, response 106 t, time 97.30 s, speed 4.82 t/s, cache miss 0
Input: I want to learn how to play the piano. What would be the best way to do it?
Response: The best way to learn how to play the piano is to practice regularly, especially at home. You can start with simple songs and gradually increase the complexity. It's important to practice every day, even if it's just for a few minutes. You can also use online resources, such as YouTube tutorials or music apps, to help you learn. Consistency is key, and it's important to take it step by step. Also, you can practice with a friend or a music teacher to get feedback and improve your skills.
Client 4, seq 89, junk = 8, prompt = 481, started decoding ...
Client 0, seq 86/128, prompt 367 t, response 40 t, time 34.79 s, speed 11.70 t/s, cache miss 0
Input: What is the best way to cook a steak?
Response: The best way to cook a steak is to cook it over a high heat and then reduce the heat to a lower temperature. This is called "steak cooking" and is a popular method.
Client 0, seq 90, junk = 6, prompt = 454, started decoding ...
Client 3, seq 78/128, prompt 329 t, response 104 t, time 92.69 s, speed 4.67 t/s, cache miss 0
Input: Tell me an interesting fact about llamas.
Response: Lamas are a type of animal that live in the Andes mountains. They are known for their beautiful appearance and are often seen in the Andes. They are also known as the "Llama of the Andes" and are a symbol of the Andes region. They are also known as the "Llama of the Andes" and are a symbol of the Andes region. They are also known as the "Lama of the Andes" and are a symbol of the Andes region.
Client 3, seq 91, junk = 3, prompt = 364, started decoding ...
Client 1, seq 82/128, prompt 382 t, response 90 t, time 78.18 s, speed 6.04 t/s, cache miss 0
Input: Tell me an interesting fact about llamas.
Response: Lamas are also called "Llama" in some countries. They are a type of animal that live in the Andes mountains. They are known for their unique way of life and their ability to live in different environments. They are also called "Llama" in some countries. They are a type of animal that live in the Andes mountains. They are known for their unique way of life and their ability to live in different environments.
Client 1, seq 92, junk = 4, prompt = 381, started decoding ...
Client 5, seq 88/128, prompt 426 t, response 19 t, time 21.30 s, speed 20.89 t/s, cache miss 0
Input: If you could have any superpower, what would it be?
Response: I would like to be able to fly. That's a superpower I could have.
Client 5, seq 93, junk = 4, prompt = 380, started decoding ...
Client 2, seq 87/128, prompt 495 t, response 45 t, time 43.79 s, speed 12.33 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also get a job through Google's own job market.
Client 2, seq 94, junk = 9, prompt = 503, started decoding ...
Client 0, seq 90/128, prompt 454 t, response 26 t, time 26.30 s, speed 18.25 t/s, cache miss 0
Input: What is the best way to learn a new language?
Response: The best way to learn a new language is to practice it daily and to immerse yourself in it as much as possible.
Client 0, seq 95, junk = 9, prompt = 507, started decoding ...
Client 4, seq 89/128, prompt 481 t, response 42 t, time 40.20 s, speed 13.01 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also apply to Google's internship program.
Client 4, seq 96, junk = 1, prompt = 313, started decoding ...
Client 3, seq 91/128, prompt 364 t, response 37 t, time 35.50 s, speed 11.30 t/s, cache miss 0
Input: If you could have any superpower, what would it be?
Response: I would like to have the ability to see into the future. I would like to be able to travel to other planets. I would like to be able to communicate with aliens.
Client 3, seq 97, junk = 5, prompt = 407, started decoding ...
Client 7, seq 83/128, prompt 336 t, response 128 t, time 112.19 s, speed 4.14 t/s, cache miss 0
Input: Recommend some interesting books to read.
Response: I recommend "The Alchemist" by Paulo Coelho, "The Catcher in the Rye" by J.D. Salinger, "To Kill a Mockingbird" by Harper Lee, "The Lord of the Rings" by J.R.R. Tolkien, "The Hobbit" by J.R.R. Tolkien, "The Grapes of Wrath" by John Steinbeck, "The Catcher in the Rye" by J.D. Salinger, "The Lord of the Rings" by J.R.R. Tolkien, "The Hobbit" by J.R.R. Tolkien, "The Grapes of Wrath" by John
Client 7, seq 98, junk = 3, prompt = 339, started decoding ...
Client 0, seq 95/128, prompt 507 t, response 32 t, time 28.11 s, speed 19.18 t/s, cache miss 0
Input: What is the best way to cook a steak?
Response: The best way to cook a steak is to cook it over a high heat and then reduce the heat to medium heat and cook it for a few minutes.
Client 0, seq 99, junk = 7, prompt = 464, started decoding ...
Client 6, seq 84/128, prompt 290 t, response 128 t, time 112.90 s, speed 3.70 t/s, cache miss 0
Input: If you could have any superpower, what would it be?
Response: I would like to have the power to see into the future. I would like to see the future in detail. I would like to know what happens in the future. I would like to know what happens in the future. I would like to know what happens in the future. I would like to know what happens in the future. I would like to know what happens in the future. I would like to know what happens in the future. I would like to know what happens in the future. I would like to know what happens in the future. I would like to know what happens in the future. I would like to know what happens
Client 6, seq 100, junk = 7, prompt = 461, started decoding ...
Client 2, seq 94/128, prompt 503 t, response 55 t, time 48.20 s, speed 11.58 t/s, cache miss 0
Input: Recommend some interesting books to read.
Response: I recommend "The Catcher in the Rye" by J.D. Salinger, "To Kill a Mockingbird" by Harper Lee, and "To Kill a Mockingbird" by Harper Lee. These books are all classics and offer a great experience.
Client 2, seq 101, junk = 8, prompt = 482, started decoding ...
Client 0, seq 99/128, prompt 464 t, response 19 t, time 18.10 s, speed 26.68 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live in the present moment and to be present with others.
Client 0, seq 102, junk = 7, prompt = 460, started decoding ...
Client 4, seq 96/128, prompt 313 t, response 42 t, time 39.30 s, speed 9.03 t/s, cache miss 0
Input: What is the best way to learn a new language?
Response: The best way to learn a new language is to practice it daily, speak with native speakers, and immerse yourself in the language. It's important to take it slowly and to be patient with yourself.
Client 4, seq 103, junk = 6, prompt = 442, started decoding ...
Client 6, seq 100/128, prompt 461 t, response 12 t, time 14.80 s, speed 31.96 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live a meaningful life.
Client 6, seq 104, junk = 4, prompt = 373, started decoding ...
Client 5, seq 93/128, prompt 380 t, response 73 t, time 68.41 s, speed 6.62 t/s, cache miss 0
Input: I want to learn how to play the piano. What would be the best way to do it?
Response: The best way to learn how to play the piano is to start with a basic piano and practice regularly. You can use online resources such as YouTube tutorials or apps like "Piano Lessons" to help you learn. It's important to practice every day and to take it step by step. Also, you can find piano lessons online or at local music schools.
Client 5, seq 105, junk = 5, prompt = 410, started decoding ...
Client 7, seq 98/128, prompt 339 t, response 42 t, time 41.00 s, speed 9.29 t/s, cache miss 0
Input: What is the best way to cook a steak?
Response: The best way to cook a steak is to cook it over a high heat and then lower the heat to medium heat and cook it for a few minutes. This is a classic method used in many restaurants.
Client 7, seq 106, junk = 1, prompt = 310, started decoding ...
Client 6, seq 104/128, prompt 373 t, response 19 t, time 18.00 s, speed 21.77 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live in the present moment and to be present with others.
Client 6, seq 107, junk = 5, prompt = 427, started decoding ...
Client 4, seq 103/128, prompt 442 t, response 25 t, time 25.60 s, speed 18.24 t/s, cache miss 0
Input: What is the best way to learn a new language?
Response: The best way to learn a new language is to practice it daily, speak it, and immerse yourself in it.
Client 4, seq 108, junk = 0, prompt = 284, started decoding ...
Client 0, seq 102/128, prompt 460 t, response 31 t, time 31.99 s, speed 15.35 t/s, cache miss 0
Input: What is the best way to cook a steak?
Response: The best way to cook a steak is to cook it over a high heat and then reduce the heat to medium and cook it for a few minutes.
Client 0, seq 109, junk = 2, prompt = 330, started decoding ...
Client 3, seq 97/128, prompt 407 t, response 69 t, time 66.21 s, speed 7.19 t/s, cache miss 0
Input: Recommend some interesting books to read.
Response: I recommend "The Alchemist" by Paulo Coelho, "The Catcher in the Rye" by J.D. Salinger, "To Kill a Mockingbird" by Harper Lee, "The Lord of the Rings" by J.R.R. Tolkien, and "Pride and Prejudice" by Jane Austen.
Client 3, seq 110, junk = 7, prompt = 465, started decoding ...
Client 6, seq 107/128, prompt 427 t, response 19 t, time 19.49 s, speed 22.88 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live in the present moment and to be present with others.
Client 6, seq 111, junk = 8, prompt = 482, started decoding ...
Client 2, seq 101/128, prompt 482 t, response 53 t, time 53.29 s, speed 10.04 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also apply to Google's internship program. You can also apply to Google's employee placement services.
Client 2, seq 112, junk = 6, prompt = 440, started decoding ...
Client 1, seq 92/128, prompt 381 t, response 117 t, time 112.30 s, speed 4.43 t/s, cache miss 0
Input: Tell me an interesting fact about llamas.
Response: Llamas are the most common type of animal in the Americas. They are also known as "the camel in the clouds." They are used for herding and carrying goods. They are also known as "the camel in the clouds" and "the camel in the clouds." They are also known as "the camel in the clouds" and "the camel in the clouds." They are also known as "the camel in the clouds" and "the camel in the clouds." They are also known as "the camel in the clouds" and "the camel in the clouds."
Client 1, seq 113, junk = 9, prompt = 529, started decoding ...
Client 5, seq 105/128, prompt 410 t, response 53 t, time 51.39 s, speed 9.01 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job board, and you can also apply to Google's internship program. You can also apply to Google's campus recruitment program.
Client 5, seq 114, junk = 5, prompt = 422, started decoding ...
Client 0, seq 109/128, prompt 330 t, response 32 t, time 32.10 s, speed 11.28 t/s, cache miss 0
Input: What is the best way to learn a new language?
Response: The best way to learn a new language is to practice it daily, speak with native speakers, and immerse yourself in the language as much as possible.
Client 0, seq 115, junk = 2, prompt = 348, started decoding ...
Client 7, seq 106/128, prompt 310 t, response 45 t, time 45.90 s, speed 7.73 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live a good life and to make a good life. It is also about making a good life and living a good life. It is about living a good life and making a good life.
Client 7, seq 116, junk = 5, prompt = 400, started decoding ...
Client 2, seq 112/128, prompt 440 t, response 19 t, time 20.89 s, speed 21.97 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live in the present moment and to be present with others.
Client 2, seq 117, junk = 9, prompt = 518, started decoding ...
Client 1, seq 113/128, prompt 529 t, response 21 t, time 22.89 s, speed 24.02 t/s, cache miss 0
Input: If you could have any superpower, what would it be?
Response: I would like to have the power to see the future. That's what I want to do.
Client 1, seq 118, junk = 5, prompt = 425, started decoding ...
Client 3, seq 110/128, prompt 465 t, response 42 t, time 44.41 s, speed 11.42 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also apply to Google's internship program.
Client 3, seq 119, junk = 5, prompt = 391, started decoding ...
Client 5, seq 114/128, prompt 422 t, response 21 t, time 24.50 s, speed 18.08 t/s, cache miss 0
Input: What is the best way to learn a new language?
Response: The best way to learn a new language is to practice it daily and to speak with native speakers.
Client 6, seq 111/128, prompt 482 t, response 39 t, time 42.41 s, speed 12.28 t/s, cache miss 0
Input: Tell me an interesting fact about llamas.
Response: Llamas are animals that live in the Andes mountains. They are known for their wool and their ability to run very fast. They are also known for their ability to climb trees.
Client 5, seq 120, junk = 4, prompt = 410, started decoding ...
Client 6, seq 121, junk = 1, prompt = 318, started decoding ...
Client 7, seq 116/128, prompt 400 t, response 19 t, time 23.60 s, speed 17.76 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live in the present moment and to be present with others.
Client 7, seq 122, junk = 9, prompt = 502, started decoding ...
Client 1, seq 118/128, prompt 425 t, response 18 t, time 20.19 s, speed 21.94 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is to live a good life and to live a good life.
Client 1, seq 123, junk = 4, prompt = 397, started decoding ...
Client 4, seq 108/128, prompt 284 t, response 74 t, time 75.70 s, speed 4.73 t/s, cache miss 0
Input: What is the meaning of life?
Response: The meaning of life is a philosophical question that varies across different cultures and religions. It is a question that has been asked by humans for thousands of years. The answer can vary depending on the person's beliefs and experiences. It is a question that can be answered in many ways, and it is a personal question that depends on the individual's own beliefs and experiences.
Client 4, seq 124, junk = 5, prompt = 417, started decoding ...
Client 0, seq 115/128, prompt 348 t, response 49 t, time 49.10 s, speed 8.08 t/s, cache miss 0
Input: Are you familiar with the Special Theory of Relativity and can you explain it to me?
Response: Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds relative to each other. It is a fundamental part of physics and is used in many areas of science and technology.
Client 0, seq 125, junk = 7, prompt = 458, started decoding ...
Client 2, seq 117/128, prompt 518 t, response 62 t, time 56.50 s, speed 10.26 t/s, cache miss 0
Input: I want to learn how to play the piano. What would be the best way to do it?
Response: The best way to learn how to play the piano is to practice regularly, study music theory, and take lessons from a professional musician. It's important to start with a simple piece and gradually increase the complexity of the music you learn. Consistent practice and dedication will help you improve your skills over time.
Client 2, seq 126, junk = 7, prompt = 469, started decoding ...
Client 4, seq 124/128, prompt 417 t, response 32 t, time 25.99 s, speed 17.27 t/s, cache miss 0
Input: What is the best way to cook a steak?
Response: The best way to cook a steak is to cook it over a high heat and then reduce the heat to medium heat and cook it for a few minutes.
Client 4, seq 127, junk = 5, prompt = 407, started decoding ...
Client 3, seq 119/128, prompt 391 t, response 59 t, time 52.99 s, speed 8.49 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also get a job through Google's own job market. You can also get a job through Google's online courses and certifications.
Client 7, seq 122/128, prompt 502 t, response 68 t, time 67.31 s, speed 8.47 t/s, cache miss 0
Input: I want to learn how to play the piano. What would be the best way to do it?
Response: The best way to learn how to play the piano is to practice regularly, listen to music, and take lessons from a professional musician. It's important to start with a simple melody and gradually build up to more complex pieces. Also, it's helpful to practice with a friend or a music teacher to get feedback and improve your skills.
Client 1, seq 123/128, prompt 397 t, response 70 t, time 73.90 s, speed 6.32 t/s, cache miss 0
Input: Are you familiar with the Special Theory of Relativity and can you explain it to me?
Response: Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds relative to each other. It states that objects in motion have a relativistic mass and that time and length are affected by the speed of the object. The theory was developed by Albert Einstein in 1905.
Client 5, seq 120/128, prompt 410 t, response 78 t, time 82.79 s, speed 5.89 t/s, cache miss 0
Input: Are you familiar with the Special Theory of Relativity and can you explain it to me?
Response: Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds relative to each other. It is also known as the theory of relativity. The theory states that the speed of light in a vacuum is constant and the same for all observers, no matter their motion. This is one of the most important theories in physics.
Client 0, seq 125/128, prompt 458 t, response 51 t, time 64.20 s, speed 7.93 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree in computer science or a related field. You can apply for internships or full-time positions at Google. You can also look for opportunities through Google's job board or other job portals.
Client 6, seq 121/128, prompt 318 t, response 82 t, time 93.49 s, speed 4.28 t/s, cache miss 0
Input: How to get a job at Google?
Response: To get a job at Google, you need to have a degree or a certificate in computer science, and you can apply for a job through Google's job portal. You can also apply for internships or other positions through Google's job portal. You can also apply for internships or other positions through Google's job portal. You can also apply for internships or other positions through Google's job portal.
Client 2, seq 126/128, prompt 469 t, response 46 t, time 70.99 s, speed 7.25 t/s, cache miss 0
Input: Recommend some interesting books to read.
Response: I recommend "The Catcher in the Rye" by J.D. Salinger and "To Kill a Mockingbird" by Harper Lee. These books are both classic fiction and have a lot of depth and meaning.
Client 4, seq 127/128, prompt 407 t, response 83 t, time 96.61 s, speed 5.07 t/s, cache miss 0
Input: Recommend some interesting books to read.
Response: I recommend "The Alchemist" by Paulo Coelho, "The Catcher in the Rye" by J.D. Salinger, "To Kill a Mockingbird" by Harper Lee, "The Lord of the Rings" by J.R.R. Tolkien, and "The Grapes of Wrath" by John Steinbeck. These books are all classics and offer a great range of stories and themes.
main: clearing the KV cache
run parameters as of 2025-12-04 09:52:05
main: n_parallel = 8, n_sequences = 128, cont_batching = 1, system tokens = 273
External prompt file: used built-in defaults
Model and path used: Qwen3-0.6B-Q4_0.gguf
Total prompt tokens: 52019, speed: 59.59 t/s
Total gen tokens: 6830, speed: 7.82 t/s
Total speed (AVG): speed: 67.41 t/s
Cache misses: 0
llama_perf_context_print: load time = 2483.67 ms
llama_perf_context_print: prompt eval time = 560641.74 ms / 58808 tokens ( 9.53 ms per token, 104.89 tokens per second)
llama_perf_context_print: eval time = 28979.60 ms / 41 runs ( 706.82 ms per token, 1.41 tokens per second)
llama_perf_context_print: total time = 872942.34 ms / 58849 tokens
llama_perf_context_print: graphs reused = 636
|
Author
|
将 response 按 prompt 归类分组,使输出结构更清晰、可读性更高 {
"What is the meaning of life?": {
"response_count": 22,
"unique_response_count": 14,
"responses": [
"The meaning of life is to live in the present moment and to be present to the present moment.",
"The meaning of life is to live a meaningful life and to find joy in life.",
"The meaning of life is to live in the present moment and to find joy in the simple things in life.",
"The meaning of life is to live in the present moment.",
"The meaning of life is a philosophical question that varies across different cultures and religions. It is a question that has been asked by humans for thousands of years. The answer can vary depending on the person's beliefs and experiences. It is a question that can be answered in many ways, and it is a personal question that depends on the individual's own beliefs and experiences.",
"The meaning of life is to live in the present moment and to be present with others.",
"The meaning of life is to live in the present moment and to find joy in every moment. It is a simple and profound concept that has been spoken about in many religions and philosophies.",
"The meaning of life is to live in the present moment and to be present with others.",
"The meaning of life is to live in the present moment and to be present to others.",
"The meaning of life is to live a good life and to make a good contribution to society.",
"The meaning of life is to live a meaningful life and to find joy and purpose in life.",
"The meaning of life is to live in the present moment and to be present with others.",
"The meaning of life is to live in the present moment and to find joy in every moment. It is a simple and profound concept that has been practiced by many people across different cultures and religions.",
"The meaning of life is to live in the present moment and to be present with others.",
"The meaning of life is to live a meaningful life.",
"The meaning of life is to live in the present moment and to be present with others.",
"The meaning of life is to live in the present moment and to be present with others.",
"The meaning of life is to live a good life and to make a good life. It is also about making a good life and living a good life. It is about living a good life and making a good life.",
"The meaning of life is to live in the present moment and to be present with others.",
"The meaning of life is to live in the present moment and to be present with others.",
"The meaning of life is to live a good life and to live a good life.",
"The meaning of life is a philosophical question that varies across different cultures and religions. It is a question that has been asked by humans for thousands of years. The answer can vary depending on the person's beliefs and experiences. It is a question that can be answered in many ways, and it is a personal question that depends on the individual's own beliefs and experiences."
]
},
"What is the best way to cook a steak?": {
"response_count": 12,
"unique_response_count": 10,
"responses": [
"The best way to cook a steak is to cook it over a high heat and then reduce the heat to medium heat and cook it for a few minutes.",
"The best way to cook a steak is to cook it over a high heat and then lower the heat to a low heat. This is the classic steak cooking method.",
"The best way to cook a steak is to cook it over a high heat and then lower the heat to medium heat. The steak should be cooked to a certain level of doneness, typically between 100-130 degrees Fahrenheit. The cooking time depends on the thickness of the steak and the desired level of doneness. The steak should be cooked to a certain level of doneness, typically between 100-130 degrees Fahrenheit. The cooking time depends on the thickness of the steak and the desired level of doneness. The steak should be cooked to a certain level of doneness, typically between 1",
"The best way to cook a steak is to cook it over a high heat and then to cook it with a good amount of oil.",
"The best way to cook a steak is to cook it over a high heat and then let it rest for a few minutes. This is a common method used in many restaurants.",
"The best way to cook a steak is to cook it over a high heat and then lower the heat to a low heat. This is called \"steaming\" and it's a popular method. The steak should be cooked to the desired level of doneness and should be seasoned properly.",
"The best way to cook a steak is to cook it over a high heat and then reduce the heat to medium heat. This is a classic method used in many restaurants.",
"The best way to cook a steak is to cook it over a high heat and then reduce the heat to a lower temperature. This is called \"steak cooking\" and is a popular method.",
"The best way to cook a steak is to cook it over a high heat and then reduce the heat to medium heat and cook it for a few minutes.",
"The best way to cook a steak is to cook it over a high heat and then lower the heat to medium heat and cook it for a few minutes. This is a classic method used in many restaurants.",
"The best way to cook a steak is to cook it over a high heat and then reduce the heat to medium and cook it for a few minutes.",
"The best way to cook a steak is to cook it over a high heat and then reduce the heat to medium heat and cook it for a few minutes."
]
},
"I want to learn how to play the piano. What would be the best way to do it?": {
"response_count": 12,
"unique_response_count": 12,
"responses": [
"The best way to learn how to play the piano is to practice regularly, study music theory, and take lessons from a professional musician. It's important to practice and to keep learning.",
"The best way to learn how to play the piano is to practice regularly, listen to music, and take lessons from a professional musician. It is important to practice and to practice every day.",
"The best way to learn how to play the piano is to practice regularly, listen to music, and take lessons from a teacher. It's important to start with a simple melody and gradually increase the complexity. Also, it's helpful to practice with a friend or a music teacher.",
"The best way to learn how to play the piano is to practice regularly, find a good teacher, and practice with a good instrument. It's important to practice and to practice every day. Also, it's important to find a good way to practice, such as using a piano or a keyboard. The best way to learn is to practice and to practice every day.",
"The best way to learn how to play the piano is to practice regularly, find a good teacher, and practice with a good instrument. It's important to practice and to practice every day.",
"The best way to learn how to play the piano is to start with a basic piano and practice regularly. You can use online resources, such as YouTube tutorials, to learn the basics. It's important to practice consistently and to take notes on your progress. Also, consider joining a piano club or community to get support and practice with others. The key is to start with the basics and gradually build up to more complex skills.",
"The best way to learn how to play the piano is to practice regularly, study music theory, and take lessons from a professional musician. It's important to start with a simple piece and gradually increase the complexity. Also, it's helpful to practice with a friend or a music teacher.",
"The best way to learn how to play the piano is to practice regularly, study music theory, and take lessons from a professional musician. It's important to start with a simple piece and gradually increase the complexity of the music you learn.",
"The best way to learn how to play the piano is to practice regularly, especially at home. You can start with simple songs and gradually increase the complexity. It's important to practice every day, even if it's just for a few minutes. You can also use online resources, such as YouTube tutorials or music apps, to help you learn. Consistency is key, and it's important to take it step by step. Also, you can practice with a friend or a music teacher to get feedback and improve your skills.",
"The best way to learn how to play the piano is to start with a basic piano and practice regularly. You can use online resources such as YouTube tutorials or apps like \"Piano Lessons\" to help you learn. It's important to practice every day and to take it step by step. Also, you can find piano lessons online or at local music schools.",
"The best way to learn how to play the piano is to practice regularly, study music theory, and take lessons from a professional musician. It's important to start with a simple piece and gradually increase the complexity of the music you learn. Consistent practice and dedication will help you improve your skills over time.",
"The best way to learn how to play the piano is to practice regularly, listen to music, and take lessons from a professional musician. It's important to start with a simple melody and gradually build up to more complex pieces. Also, it's helpful to practice with a friend or a music teacher to get feedback and improve your skills."
]
},
"How to get a job at Google?": {
"response_count": 20,
"unique_response_count": 16,
"responses": [
"To get a job at Google, you need to have a degree in computer science or a related field. You can apply to the Google job board, and you can also get a job through Google's internal recruitment process.",
"To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also apply to Google's internship program. You can also apply to Google's employee placement services.",
"To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also get a job through Google's own job market.",
"To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also apply to Google's internship program.",
"To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also apply to Google's internship program. You can also apply to Google's employee placement program. You can also apply to Google's recruitment process.",
"To get a job at Google, you need to have a degree or a bachelor's degree. You can apply for internships or part-time jobs at Google. You can also get a job through Google's career center. You can also get a job through Google's job board. You can also get a job through Google's internship program. You can also get a job through Google's employee portal. You can also get a job through Google's job search. You can also get a job through Google's job board. You can also get a job through Google's career center. You can also get a job through Google's employee portal.",
"To get a job at Google, you need to have a degree in computer science or a related field. You can apply for internships or full-time positions at Google. You can also look for opportunities in other companies or organizations.",
"To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also apply to Google's internship program. You can also apply to Google's campus recruitment program. You can also apply to Google's employee placement program. You can also apply to Google's job search program. You can also apply to Google's online job platform.",
"To get a job at Google, you need to have a degree or a bachelor's degree. You can apply for internships or part-time jobs. You can also get a job through Google's career center. You can also get a job through Google's job board. You can also get a job through Google's online courses. You can also get a job through Google's online courses and other online resources. You can also get a job through Google's online courses and other online resources.",
"To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's campus programs, which are open to all students. You can also apply to Google's online job portals, which are available for all individuals. You can also apply to Google's internship programs, which are available for students and professionals. You can also apply to Google's employee placement services, which are available for all individuals.",
"To get a job at Google, you need to have a degree in computer science or a related field. You can apply for internships or full-time positions at Google. You can also look for opportunities in the tech industry.",
"To get a job at Google, you need to have a degree in computer science or a related field. You can apply to the Google job board, and you can also get a job through Google's internal programs.",
"To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also get a job through Google's own job market.",
"To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also apply to Google's internship program.",
"To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also apply to Google's internship program. You can also apply to Google's employee placement services.",
"To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job board, and you can also apply to Google's internship program. You can also apply to Google's campus recruitment program.",
"To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also apply to Google's internship program.",
"To get a job at Google, you need to have a degree in computer science or a related field. You can apply to Google's job portals, and you can also get a job through Google's own job market. You can also get a job through Google's online courses and certifications.",
"To get a job at Google, you need to have a degree in computer science or a related field. You can apply for internships or full-time positions at Google. You can also look for opportunities through Google's job board or other job portals.",
"To get a job at Google, you need to have a degree or a certificate in computer science, and you can apply for a job through Google's job portal. You can also apply for internships or other positions through Google's job portal. You can also apply for internships or other positions through Google's job portal. You can also apply for internships or other positions through Google's job portal."
]
},
"Are you familiar with the Special Theory of Relativity and can you explain it to me?": {
"response_count": 13,
"unique_response_count": 9,
"responses": [
"Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds relative to each other. It is a fundamental part of physics and is used in many areas of science and technology.",
"The Special Theory of Relativity is a theory that describes the relationship between space and time. It states that the laws of physics are the same in all inertial frames of reference. This means that the speed of light is constant in a given frame of reference, and that the laws of physics are the same in all inertial frames of reference. The theory was developed by Einstein in 1905. It is a fundamental part of modern physics and is used in many areas of science and technology.",
"Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds relative to each other. It states that time, length, and mass are all affected by the speed of an object in motion. The theory was developed by Einstein and is one of the most important theories in physics.",
"Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at speeds close to the speed of light. It is a fundamental theory of physics and is one of the most important theories in physics.",
"Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds relative to each other. It is a fundamental part of physics and is used in many areas of science and technology.",
"Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds. It states that objects in motion have a relativistic effect on their speed. The theory was developed by Albert Einstein in 1905. It is a fundamental part of modern physics and is used in many areas of science and technology.",
"Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds relative to each other. It is a fundamental part of physics and is used in many areas of science and technology.",
"Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds relative to each other. It is also known as the theory of relativity. The theory states that the speed of light is constant in a vacuum, and that objects moving at high speeds relative to each other experience time dilation. The theory also states that the laws of physics are the same in all inertial frames of reference. The theory is one of the most important theories in physics.",
"Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds. It states that objects in motion have a relativistic mass and that time and space are affected by the speed of the object. The theory was developed by Einstein and is one of the most important theories in physics.",
"Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds relative to each other. It is a fundamental part of physics and is used in many areas of science and technology.",
"Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds relative to each other. It is a fundamental part of physics and is used in many areas of science and technology.",
"Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds relative to each other. It states that objects in motion have a relativistic mass and that time and length are affected by the speed of the object. The theory was developed by Albert Einstein in 1905.",
"Yes, I am familiar with the Special Theory of Relativity. It is a theory that describes how objects move at high speeds relative to each other. It is also known as the theory of relativity. The theory states that the speed of light in a vacuum is constant and the same for all observers, no matter their motion. This is one of the most important theories in physics."
]
},
"If you could have any superpower, what would it be?": {
"response_count": 13,
"unique_response_count": 11,
"responses": [
"I would like to have the power to see the future. I would like to see the future in a way that allows me to make a difference in the world. I would like to be able to see the future in a way that allows me to make a difference in the world.",
"I would like to have the power to see into the future. I would like to see the future in detail. I would like to know what happens in the future. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail.",
"I would like to have the power to see the future. I would like to see the future, and I would like to see the future.",
"I would like to have the power to see into the future. I would like to see the future in detail. I would like to know what happens in the future. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail.",
"I would like to have the power to see the future. I would like to be able to predict the future. I would like to know the future in advance. I would like to know the future in advance, and I would like to know the future in advance, and I would like to know the future in advance.",
"I would like to be able to fly. I would like to fly to the top of the world. I would like to fly to the sky.",
"I would like to have the power to see into the future. I would like to see the future in detail. I would like to know what happens in the future. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail. I would like to know what happens in the future in detail.",
"I would like to have the power to see the future. I would like to be able to change the world.",
"I would like to have the ability to see into the future. I would like to be able to travel to other dimensions. I would like to be able to change the future.",
"I would like to be able to fly. That's a superpower I could have.",
"I would like to have the ability to see into the future. I would like to be able to travel to other planets. I would like to be able to communicate with aliens.",
"I would like to have the power to see into the future. I would like to see the future in detail. I would like to know what happens in the future. I would like to know what happens in the future. I would like to know what happens in the future. I would like to know what happens in the future. I would like to know what happens in the future. I would like to know what happens in the future. I would like to know what happens in the future. I would like to know what happens in the future. I would like to know what happens in the future. I would like to know what happens",
"I would like to have the power to see the future. That's what I want to do."
]
},
"Recommend some interesting books to read.": {
"response_count": 12,
"unique_response_count": 10,
"responses": [
"I recommend \"The Alchemist\" by Paulo Coelho. It is a classic book about the journey of a person who starts with a dream and works to achieve it. It is a story about the journey of a person who starts with a dream and works to achieve it. It is a story about the journey of a person who starts with a dream and works to achieve it. It is a story about the journey of a person who starts with a dream and works to achieve it.",
"I recommend \"The Alchemist\" by Paulo Coelho. It is a classic book about the journey of a person who starts with a dream and works to achieve it. It is a story about the life of a person who starts with a dream and works to achieve it. It is a story about the life of a person who starts with a dream and works to achieve it. It is a story about the life of a person who starts with a dream and works to achieve it.",
"I recommend \"The Catcher in the Rye\" by J.D. Salinger, \"To Kill a Mockingbird\" by Harper Lee, \"To Kill a Mockingbird\" by Harper Lee, and \"The Grapes of Wrath\" by John Steinbeck. These books are all classics and offer a great experience.",
"I recommend \"The Alchemist\" by Paulo Coelho. It is a classic book about the journey of a person who starts with a dream and works to achieve it. It is a story about the life of a person who starts with a dream and works to achieve it. It is a story about the life of a person who starts with a dream and works to achieve it. It is a story about the life of a person who starts with a dream and works to achieve it.",
"I recommend \"The Alchemist\" by Paulo Coelho, \"The Catcher in the Rye\" by J.D. Salinger, \"To Kill a Mockingbird\" by Harper Lee, \"The Lord of the Rings\" by J.R.R. Tolkien, and \"The Hobbit\" by J.R.R. Tolkien. These books are all highly recommended for reading.",
"I recommend \"The Catcher in the Rye\" by J.D. Salinger and \"To Kill a Mockingbird\" by Harper Lee. These books are both classic fiction and have a lot of depth and meaning.",
"I recommend \"The Catcher in the Rye\" by J.D. Salinger, \"To Kill a Mockingbird\" by Harper Lee, and \"To Kill a Mockingbird\" by Harper Lee. These books are all classics and have a lot of depth and meaning.",
"I recommend \"The Alchemist\" by Paulo Coelho, \"The Catcher in the Rye\" by J.D. Salinger, \"To Kill a Mockingbird\" by Harper Lee, \"The Lord of the Rings\" by J.R.R. Tolkien, \"The Hobbit\" by J.R.R. Tolkien, \"The Grapes of Wrath\" by John Steinbeck, \"The Catcher in the Rye\" by J.D. Salinger, \"The Lord of the Rings\" by J.R.R. Tolkien, \"The Hobbit\" by J.R.R. Tolkien, \"The Grapes of Wrath\" by John",
"I recommend \"The Catcher in the Rye\" by J.D. Salinger, \"To Kill a Mockingbird\" by Harper Lee, and \"To Kill a Mockingbird\" by Harper Lee. These books are all classics and offer a great experience.",
"I recommend \"The Alchemist\" by Paulo Coelho, \"The Catcher in the Rye\" by J.D. Salinger, \"To Kill a Mockingbird\" by Harper Lee, \"The Lord of the Rings\" by J.R.R. Tolkien, and \"Pride and Prejudice\" by Jane Austen.",
"I recommend \"The Catcher in the Rye\" by J.D. Salinger and \"To Kill a Mockingbird\" by Harper Lee. These books are both classic fiction and have a lot of depth and meaning.",
"I recommend \"The Alchemist\" by Paulo Coelho, \"The Catcher in the Rye\" by J.D. Salinger, \"To Kill a Mockingbird\" by Harper Lee, \"The Lord of the Rings\" by J.R.R. Tolkien, and \"The Grapes of Wrath\" by John Steinbeck. These books are all classics and offer a great range of stories and themes."
]
},
"Tell me an interesting fact about llamas.": {
"response_count": 15,
"unique_response_count": 15,
"responses": [
"Llamas are a type of camel used for herding and grazing in the Andes region of South America. They are known for their strong and powerful bodies and their ability to run very fast.",
"Llamas are a type of camel used for herding and grazing in the Andes region of South America. They are known for their strong and powerful muscles and their ability to run long distances. They are also known for their unique social behavior and communication methods.",
"Llamas are a type of camel used for herding and transporting goods. They are known for their strong and powerful legs and their ability to run very fast. They are also known for their ability to carry heavy loads.",
"Llamas are animals that live in the Andes mountains. They are known for their wool and their ability to carry heavy loads. They are also known for their strong and powerful legs and their ability to run very fast. They are also known for their ability to climb trees and for their ability to carry heavy loads.",
"Llamas are a type of camel used for herding and grazing. They are also known as \"camel in the desert\" and \"camel in the field\". They are native to the Andes Mountains in South America.",
"Llamas are a type of camel that live in the Andes Mountains. They are known for their ability to climb trees and their strong legs. They are also known for their ability to carry heavy loads. They are a popular source of meat and are often found in the Andes region.",
"Llamas are a type of camel used for herding and grazing. They are known for their ability to walk on water and their ability to carry heavy loads.",
"Llamas are a type of camel used for herding and grazing in the Andes region of South America. They are known for their ability to walk on water and their ability to carry heavy loads. They are also known for their intelligence and adaptability. They are a symbol of the Andes culture and are often seen in Andes folklore.",
"llamas are animals that live in the Andes mountains in South America. They are known for their wool and their ability to carry heavy loads.",
"Llamas are a type of camel that are native to South America. They are known for their ability to walk on water and their ability to carry heavy loads. They are also known for their ability to climb trees and their ability to swim in water. They are also known for their ability to carry heavy loads and their ability to walk on water. They are also known for their ability to carry heavy loads and their ability to walk on water. They are also known for their ability to carry heavy loads and their ability to walk on water.",
"Lamas are a type of llama that are known for their unique behavior and are often found in the Andes region of South America. They are also known as \"the llamas of the Andes\" and are considered to be one of the most important species of llamas in the world.",
"Lamas are a type of animal that live in the Andes mountains. They are known for their beautiful appearance and are often seen in the Andes. They are also known as the \"Llama of the Andes\" and are a symbol of the Andes region. They are also known as the \"Llama of the Andes\" and are a symbol of the Andes region. They are also known as the \"Lama of the Andes\" and are a symbol of the Andes region.",
"Lamas are also called \"Llama\" in some countries. They are a type of animal that live in the Andes mountains. They are known for their unique way of life and their ability to live in different environments. They are also called \"Llama\" in some countries. They are a type of animal that live in the Andes mountains. They are known for their unique way of life and their ability to live in different environments.",
"Llamas are the most common type of animal in the Americas. They are also known as \"the camel in the clouds.\" They are used for herding and carrying goods. They are also known as \"the camel in the clouds\" and \"the camel in the clouds.\" They are also known as \"the camel in the clouds\" and \"the camel in the clouds.\" They are also known as \"the camel in the clouds\" and \"the camel in the clouds.\" They are also known as \"the camel in the clouds\" and \"the camel in the clouds.\"",
"Llamas are animals that live in the Andes mountains. They are known for their wool and their ability to run very fast. They are also known for their ability to climb trees."
]
},
"What is the best way to learn a new language?": {
"response_count": 9,
"unique_response_count": 9,
"responses": [
"The best way to learn a new language is to practice it daily, speak with native speakers, and immerse yourself in the language culture.",
"The best way to learn a new language is to practice it daily and immerse yourself in it.",
"The best way to learn a new language is to practice it daily and immerse yourself in the language as much as possible.",
"The best way to learn a new language is to practice it daily and immerse yourself in it as much as possible.",
"The best way to learn a new language is to practice it daily and to immerse yourself in it as much as possible.",
"The best way to learn a new language is to practice it daily, speak with native speakers, and immerse yourself in the language. It's important to take it slowly and to be patient with yourself.",
"The best way to learn a new language is to practice it daily, speak it, and immerse yourself in it.",
"The best way to learn a new language is to practice it daily, speak with native speakers, and immerse yourself in the language as much as possible.",
"The best way to learn a new language is to practice it daily and to speak with native speakers."
]
}
} |
Owner
|
非常感谢你的贡献!!可以把这部份代码贡献到上游社区嘛 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
描述 (Description)
本次 PR 主要对 CANN 后端进行两处小而安全的精度鲁棒性增强修复,目标是提升
llama-parallel在多并发场景下的稳定性。相关问题:issue "llama-parallel 精度问题Bug修复"
主要改动:
aclnn_reduce_sum中引入 FP32 中间 bufferdst->type生成相同 dtype 的 gamma,并填充值为 1.0,行为更合理且与新版社区代码一致。这两处改动均不改变算子的数学定义,只提升数值稳定性与混合精度场景兼容性。
测试 (Testing)
使用 Qwen3-0.6B Q4_0 模型在 Ascend910B2 上运行以下脚本进行验证:
测试中 8 个客户端并行执行 128 次请求,模型生成结果稳定、无异常输出,整体回答逻辑清晰、风格一致,符合预期。
部分运行结果见附件日志:
(来自 PR 讨论区的完整日志,同步提交)
此外:
-junk参数特性导致的——模型收到可变数量的“干扰 token”后,本身就会产生轻微多样性。此现象在CPU / GPU 后端同样发生,属于预期行为。-junk 0时,所有并发请求达到完全一致的 deterministic 输出。因此可以确认,修改后的行为正确且鲁棒性优于修改前
备注 (Notes)