daekeun-ml
/

Phi-3-medium-4k-instruct-ko-poc-v0.1

@@ -20,7 +20,7 @@ pipeline_tag: text-generation
 ## Model Details
 This model is trained using unsloth toolkit based on Microsoft's phi-3 model with some Korean instruction data added to enhance its Korean generation performance
-Since my role is not as a working developer, but as ML Technical Specialist helping customers with quick PoCs/prototypes, and I was limited by Azure GPU resources available, I only trained with 40,000 samples on a single A100 GPU () for PoC purposes. Because I have not done any tokenizer extensions, you need a lot more tokens than English for text generation.
 ### Dataset
@@ -32,6 +32,8 @@ The dataset used for training is as follows. To prevent catastrophic forgetting,
 ## How to Get Started with the Model
 ```python
 ### Load model
 import torch
@@ -67,6 +69,7 @@ params = {
 ### Inference
 FastLanguageModel.for_inference(model) # Enable native 2x faster inference
 messages = [
     {"from": "human", "value": "Continue the fibonnaci sequence in Korean: 1, 1, 2, 3, 5, 8,"},
     {"from": "assistant", "value": "피보나치 수열의 다음 숫자는 13, 21, 34, 55, 89 등입니다. 각 숫자는 앞의 두 숫자의 합입니다."},
@@ -82,6 +85,7 @@ inputs = tokenizer.apply_chat_template(
 text_streamer = TextStreamer(tokenizer)
 _ = model.generate(input_ids = inputs, streamer = text_streamer, **params)
 messages = [
     {"from": "human", "value": "What is Machine Learning in Korean?"},
     {"from": "assistant", "value": "인공지능의 한 분야로 방대한 데이터를 분석해 향후 패턴을 예측하는 기법입니다."},
@@ -99,6 +103,29 @@ text_streamer = TextStreamer(tokenizer)
 _ = model.generate(input_ids = inputs, streamer = text_streamer, **params)
 ```
 ### References
 - Base model: [microsoft/phi-2](https://huggingface.co/microsoft/phi-2)

 ## Model Details
 This model is trained using unsloth toolkit based on Microsoft's phi-3 model with some Korean instruction data added to enhance its Korean generation performance
+Since my role is not as a working developer, but as ML Technical Specialist helping customers with quick PoCs/prototypes, and I was limited by Azure GPU resources available, I only trained with 40,000 samples on a single VM Azure Standard_NC24ads_A100_v4 for PoC purposes. Because I have not done any tokenizer extensions, you need a lot more tokens than English for text generation.
 ### Dataset
 ## How to Get Started with the Model
+### Code snippets
 ```python
 ### Load model
 import torch
 ### Inference
 FastLanguageModel.for_inference(model) # Enable native 2x faster inference
+# 1st example
 messages = [
     {"from": "human", "value": "Continue the fibonnaci sequence in Korean: 1, 1, 2, 3, 5, 8,"},
     {"from": "assistant", "value": "피보나치 수열의 다음 숫자는 13, 21, 34, 55, 89 등입니다. 각 숫자는 앞의 두 숫자의 합입니다."},
 text_streamer = TextStreamer(tokenizer)
 _ = model.generate(input_ids = inputs, streamer = text_streamer, **params)
+# 2nd example
 messages = [
     {"from": "human", "value": "What is Machine Learning in Korean?"},
     {"from": "assistant", "value": "인공지능의 한 분야로 방대한 데이터를 분석해 향후 패턴을 예측하는 기법입니다."},
 _ = model.generate(input_ids = inputs, streamer = text_streamer, **params)
 ```
+### Inference results
+```
+# 1st example
+<s><|user|> Continue the fibonnaci sequence in Korean: 1, 1, 2, 3, 5, 8,<|end|><|assistant|> 피보나치 수열의 다음 숫자는 13, 21, 34, 55, 89 등입니다. 각 숫자는 앞의 두 숫자의 합입니다.<|end|><|user|> Compute 2x+3=12 in Korean<|end|><|assistant|> 방정식 2x + 3 = 12에서 x를 풀려면 다음 단계를 따르십시오.
+1. 방정식의 양쪽에서 3을 빼서 방정식의 한쪽에 대해 x를 분리합니다.
+2x + 3 - 3 = 12 - 3
+2x = 9
+2. 이제 방정식의 양쪽을 2로 나누어 x의 값을 구합니다.
+2배 / 2 = 9 / 2
+x = 4.5
+따라서 방정식 2x + 3 = 12에 대한 해는 x = 4.5입니다.<|end|>
+# 2nd example
+<s><|user|> What is Machine Learning in Korean?<|end|><|assistant|> 인공지능의 한 분야로 방대한 데이터를 분석해 향후 패턴을 예측하는 기법입니다.<|end|><|user|> What is Deep Learning in Korean?<|end|><|assistant|> 복잡한 데이터 세트를 분석하고 복잡한 패턴을 인식하고 학습하는 데 사용되는 딥러닝은 많은 레이어로 구성된 신경망의 하위 집합입니다. 이 기술은 이미지 인식, 자연어 처리 및 자율 운전과 같은 다양한 응용 분야에서 큰 발전을 이뤘습니다.<|end|>
+```
 ### References
 - Base model: [microsoft/phi-2](https://huggingface.co/microsoft/phi-2)