TroyDoesAI commited on
Commit
32d8b15
1 Parent(s): a59beaf

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +39 -88
README.md CHANGED
@@ -2,110 +2,59 @@
2
  license: cc-by-4.0
3
  ---
4
 
5
- # Mermaid-Llama-3-8B:
6
 
7
- ---
8
- New Prompt Template.
9
- "instruction,input,output": "Mermaid-Generator-Request: %instruction%\n\n### Input:\n%input%\n\n### Mermaid:\n%output%"
10
-
11
- ---
12
 
13
- ---
14
- license: apache-2.0
15
- ---
16
-
17
- **MermaidLLama: Unleashing the Power of Llama 3 - 8 Billion Parameters**
18
-
19
- Introducing Mermaid-LLama-3-8B, a robust language model designed for Python code understanding and crafting captivating story flow maps.
20
- With a petite 8.3 billion parameters, this model builds on the success of LLama 3's ability to maintain versatility in programming, mathematical reasoning,
21
- and general language processing.
22
  ![MermaidLlama GIF](Mermaid_ShowCase/MermaidLlama.webp)
23
 
24
-
25
- **Key Features:**
26
 
27
  1. **Code Understanding:**
28
  - Masters Python intricacies with finesse.
29
  - Generates clear and accurate Mermaid Diagram Flow Charts.
30
- - Ideal for developers seeking visual representations of their code's logic to better solve problems.
31
 
32
  2. **Storytelling Capabilities:**
33
  - Converts narrative inputs into captivating Mermaid Diagrams.
34
  - Maps character interactions, plot developments, and narrative arcs.
35
 
36
  3. **Unmatched Performance:**
37
- - Surpasses GPT-4 in generating well-organized and detailed Mermaid Diagrams, its not even a competition Mermaid's do it better.🧜‍♀️
38
 
39
  4. **Training Insights:**
40
- - Trained on a diverse dataset, including 800 unique, hand curated Mermaid Graph examples utilizing 478 complete python programs (hand tested each program) rather than code snippets.
41
- - Exhibits emergent properties in story-to-flow map translations and step by step instruction flow maps to which it was not trained for.
42
- - Adaptable and willing to be Prompt Engineered for your specific utilization.
43
-
44
- **Collaboration:**
45
 
46
- Mermaid is open to collaboration to further enhance its capabilities.
47
 
48
- If you're interested in contributing or collaborating, feel free to reach out to troydoesai@gmail.com.
49
- Your expertise could play a pivotal role in refining Mermaid.
50
 
51
- **Example Use Cases:**
 
 
 
 
 
 
52
 
53
- - **Code Documentation:**
54
- Developers can use Mermaid to automatically generate visual flow charts from their Python code, aiding in documentation and code understanding.
55
-
56
- - **Storyboarding:**
57
- Storytellers and writers can input their narrative and receive visually appealing Mermaid Diagrams, offering a structured overview of character interactions and plot progression.
58
-
59
- - **Project Planning:**
60
- Project managers can create visual project flow maps, facilitating effective communication and planning among team members.
61
-
62
- - **Learning Python:**
63
- Students and beginners can visually understand Python code structures, enhancing their learning experience.
64
-
65
- - **Game Design:**
66
- Game developers can utilize Mermaid for visualizing game storylines, ensuring a coherent narrative structure and character development.
67
-
68
- **Proof of Concept:**
69
-
70
- Mermaid proves that innovation thrives in compact packages, delivering exceptional performance across diverse applications.
71
 
72
  Stay tuned for the release of the VSCode Extension that displays the Live Flow Map every time a user stops typing for more than 10 seconds.
73
 
74
- For best results, use full precision with one of the three different instruction types:
75
-
76
- Exciting times ahead as we delve into the MermaidLLama revolution! 🚀
77
-
78
- # Training:
79
- LoRA Rank
80
- - 2048
81
 
82
- LoRA Alpha
83
- - 4096
 
 
 
 
 
 
 
84
 
85
- Batch Size
86
- - 1
87
-
88
- Micro Batch Size
89
- - 1
90
-
91
- Cutoff Length
92
- - 4096
93
-
94
- Save every n steps
95
- - 1000
96
-
97
- Epochs
98
- - 3
99
-
100
- Learning Rate
101
- - 1e-6
102
-
103
- LR Scheduler
104
- - cosine
105
-
106
-
107
- Target Modules
108
-
109
  - Enable q_proj
110
  - Enable v_proj
111
  - Enable k_proj
@@ -114,25 +63,27 @@ Target Modules
114
  - Enable down_proj
115
  - Enable up_proj
116
 
117
- Example of what you would send to my model to get a flow graph that ends with ``` every time.
118
-
119
- https://drive.google.com/file/d/1z1QM9KZdg_VeH-tGwIY-cyxDas0ucnKc/view?usp=sharing
120
- ---
121
 
122
  Start by downloading one of my models.
 
123
  ![0 TroyDoesAI GIF](Mermaid_ShowCase/0_TroyDoesAI.gif)
124
 
125
  Load the model.
 
126
  ![1 Load Model in 4-bit Show Example Use GIF](Mermaid_ShowCase/1_LoadModel_in_4bit_Show_Example_Use.gif)
127
 
128
- Inference the model using my prompt template replacing your instruction and input to generate a Mermaid code block,
129
- that can be viewed in the Mermaid Live Editor or Using the Mermaid CLI tool directly creating an image of the flow map.
130
- More on the cli tool here: https://github.com/mermaid-js/mermaid-cli
131
 
132
  ![2 Loaded Model in Full Precision 16-bit Show Inference and Mermaid Live Editor GIF](Mermaid_ShowCase/2_Loaded_Model_in_Full_Precision_16bit_Show_Inference_and_Mermaid_Live_editor.gif)
133
 
134
- Here We Open the additional VLLM GUI Program I wrote for the example, while still running the Mermaid-Llama-8B to compare the flow diagram to the actual program.
 
135
  ![3 Open The Program VLLM Program With Full Precision Mermaid-Llama-8B Running to Evaluate Flow Map GIF](Mermaid_ShowCase/3_Open_The_Program_VLLM_Program_With_Full_Precision_Mermaid-Llama-8B-Running_to_evaluate_flow_map.gif)
136
 
137
- # More on my cute VLLM Class and inference GUI.
 
138
  ![Python RtdBsaz8gy GIF](Mermaid_ShowCase/python_RtdBsaz8gy.gif)
 
 
 
 
2
  license: cc-by-4.0
3
  ---
4
 
5
+ # Mermaid-Llama-3-8B
6
 
7
+ Introducing Mermaid-LLama-3-8B, a robust language model designed for Python code understanding and crafting captivating story flow maps. With 8.3 billion parameters, this model excels in programming, mathematical reasoning, and general language processing.
 
 
 
 
8
 
 
 
 
 
 
 
 
 
 
9
  ![MermaidLlama GIF](Mermaid_ShowCase/MermaidLlama.webp)
10
 
11
+ ## Key Features
 
12
 
13
  1. **Code Understanding:**
14
  - Masters Python intricacies with finesse.
15
  - Generates clear and accurate Mermaid Diagram Flow Charts.
16
+ - Ideal for developers seeking visual representations of their code logic.
17
 
18
  2. **Storytelling Capabilities:**
19
  - Converts narrative inputs into captivating Mermaid Diagrams.
20
  - Maps character interactions, plot developments, and narrative arcs.
21
 
22
  3. **Unmatched Performance:**
23
+ - Surpasses GPT-4 in generating well-organized Mermaid Diagrams.
24
 
25
  4. **Training Insights:**
26
+ - Trained on a diverse dataset, including 800 unique, hand-curated Mermaid Graph examples utilizing 478 complete Python programs.
27
+ - Exhibits emergent properties in story-to-flow map translations and step-by-step instruction flow maps.
 
 
 
28
 
29
+ ## Collaboration
30
 
31
+ Interested in enhancing Mermaid's capabilities? Contact troydoesai@gmail.com for collaboration opportunities.
 
32
 
33
+ ## Example Use Cases
34
+ - **Retrieval-Augmented Generation (RAG):** Utilize Mermaid-LLama-3-8B to create condensed knowledge graphs. This model excels in generating flow diagrams that enhance the retrieval process. These knowledge graphs are stored in a vector database, which allows for quick and efficient retrieval of contextually relevant information. When a query is received, the system retrieves a pertinent knowledge graph, appending it as context to the model. This enriched context enables Mermaid-LLama-3-8B to deliver more accurate and nuanced responses. This approach is particularly beneficial in applications requiring deep, context-aware interactions, such as sophisticated Q&A systems, dynamic data analysis, and complex decision-making tasks.
35
+ - **Code Documentation:** Automatic visual flow charts from Python code.
36
+ - **Storyboarding:** Visually appealing diagrams for storytelling.
37
+ - **Project Planning:** Visual project flow maps for effective team communication.
38
+ - **Learning Python:** Helps students visually understand Python code structures.
39
+ - **Game Design:** Visualizing game storylines for coherent narrative structure.
40
 
41
+ ## Proof of Concept
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
42
 
43
  Stay tuned for the release of the VSCode Extension that displays the Live Flow Map every time a user stops typing for more than 10 seconds.
44
 
45
+ ## Training Specifications
 
 
 
 
 
 
46
 
47
+ - **LoRA Rank:** 2048
48
+ - **LoRA Alpha:** 4096
49
+ - **Batch Size:** 1
50
+ - **Micro Batch Size:** 1
51
+ - **Cutoff Length:** 4096
52
+ - **Save every n steps:** 1000
53
+ - **Epochs:** 3
54
+ - **Learning Rate:** 1e-6
55
+ - **LR Scheduler:** Cosine
56
 
57
+ **Target Modules:**
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
58
  - Enable q_proj
59
  - Enable v_proj
60
  - Enable k_proj
 
63
  - Enable down_proj
64
  - Enable up_proj
65
 
66
+ ## Getting Started
 
 
 
67
 
68
  Start by downloading one of my models.
69
+
70
  ![0 TroyDoesAI GIF](Mermaid_ShowCase/0_TroyDoesAI.gif)
71
 
72
  Load the model.
73
+
74
  ![1 Load Model in 4-bit Show Example Use GIF](Mermaid_ShowCase/1_LoadModel_in_4bit_Show_Example_Use.gif)
75
 
76
+ Use my prompt template to generate a Mermaid code block, which can be viewed in the Mermaid Live Editor or using the Mermaid CLI tool.
 
 
77
 
78
  ![2 Loaded Model in Full Precision 16-bit Show Inference and Mermaid Live Editor GIF](Mermaid_ShowCase/2_Loaded_Model_in_Full_Precision_16bit_Show_Inference_and_Mermaid_Live_editor.gif)
79
 
80
+ Here we open the VLLM GUI Program while still running in Vram the Mermaid-Llama-8B to compare the flow diagram to the actual program and show the lightweight capabilites of small models on consumer hardware.
81
+
82
  ![3 Open The Program VLLM Program With Full Precision Mermaid-Llama-8B Running to Evaluate Flow Map GIF](Mermaid_ShowCase/3_Open_The_Program_VLLM_Program_With_Full_Precision_Mermaid-Llama-8B-Running_to_evaluate_flow_map.gif)
83
 
84
+ ## More on my VLLM Class and inference GUI
85
+
86
  ![Python RtdBsaz8gy GIF](Mermaid_ShowCase/python_RtdBsaz8gy.gif)
87
+ ---
88
+
89
+ Note: This model should be treated as an Auto-Complete Model, Do not try talking to it in chat you are gonna get garbage, those layers have been pruned and replaced, that is all you will hear of my secret sauce on training on small < 1000 entry datasets.