Skip to content

Commit

Permalink
Deployed 8f657c7 with MkDocs version: 1.4.3
Browse files Browse the repository at this point in the history
  • Loading branch information
Unknown committed Dec 6, 2023
1 parent a7f47b9 commit 7313c11
Show file tree
Hide file tree
Showing 3 changed files with 26 additions and 1 deletion.
25 changes: 25 additions & 0 deletions model_zoo/index.html
Original file line number Diff line number Diff line change
Expand Up @@ -602,6 +602,7 @@ <h1 id="public-model-zoo">Public Model Zoo<a class="headerlink" href="#public-mo
<th>Inference APIs Available</th>
<th>Fine-tuning APIs Available</th>
<th>Inference Frameworks Available</th>
<th>Inference max total tokens (prompt + response)</th>
</tr>
</thead>
<tbody>
Expand All @@ -610,144 +611,168 @@ <h1 id="public-model-zoo">Public Model Zoo<a class="headerlink" href="#public-mo
<td></td>
<td></td>
<td>deepspeed, text-generation-inference</td>
<td>2048</td>
</tr>
<tr>
<td><code>llama-2-7b</code></td>
<td></td>
<td></td>
<td>text-generation-inference, vllm</td>
<td>4096</td>
</tr>
<tr>
<td><code>llama-2-7b-chat</code></td>
<td></td>
<td></td>
<td>text-generation-inference, vllm</td>
<td>4096</td>
</tr>
<tr>
<td><code>llama-2-13b</code></td>
<td></td>
<td></td>
<td>text-generation-inference, vllm</td>
<td>4096</td>
</tr>
<tr>
<td><code>llama-2-13b-chat</code></td>
<td></td>
<td></td>
<td>text-generation-inference, vllm</td>
<td>4096</td>
</tr>
<tr>
<td><code>llama-2-70b</code></td>
<td></td>
<td></td>
<td>text-generation-inference, vllm</td>
<td>4096</td>
</tr>
<tr>
<td><code>llama-2-70b-chat</code></td>
<td></td>
<td></td>
<td>text-generation-inference, vllm</td>
<td>4096</td>
</tr>
<tr>
<td><code>falcon-7b</code></td>
<td></td>
<td></td>
<td>text-generation-inference, vllm</td>
<td>2048</td>
</tr>
<tr>
<td><code>falcon-7b-instruct</code></td>
<td></td>
<td></td>
<td>text-generation-inference, vllm</td>
<td>2048</td>
</tr>
<tr>
<td><code>falcon-40b</code></td>
<td></td>
<td></td>
<td>text-generation-inference, vllm</td>
<td>2048</td>
</tr>
<tr>
<td><code>falcon-40b-instruct</code></td>
<td></td>
<td></td>
<td>text-generation-inference, vllm</td>
<td>2048</td>
</tr>
<tr>
<td><code>mpt-7b</code></td>
<td></td>
<td></td>
<td>deepspeed, text-generation-inference, vllm</td>
<td>2048</td>
</tr>
<tr>
<td><code>mpt-7b-instruct</code></td>
<td></td>
<td></td>
<td>deepspeed, text-generation-inference, vllm</td>
<td>2048</td>
</tr>
<tr>
<td><code>flan-t5-xxl</code></td>
<td></td>
<td></td>
<td>deepspeed, text-generation-inference</td>
<td>2048</td>
</tr>
<tr>
<td><code>mistral-7b</code></td>
<td></td>
<td></td>
<td>vllm</td>
<td>8000</td>
</tr>
<tr>
<td><code>mistral-7b-instruct</code></td>
<td></td>
<td></td>
<td>vllm</td>
<td>8000</td>
</tr>
<tr>
<td><code>codellama-7b</code></td>
<td></td>
<td></td>
<td>text-generation-inference, vllm</td>
<td>16384</td>
</tr>
<tr>
<td><code>codellama-7b-instruct</code></td>
<td></td>
<td></td>
<td>text-generation-inference, vllm</td>
<td>16384</td>
</tr>
<tr>
<td><code>codellama-13b</code></td>
<td></td>
<td></td>
<td>text-generation-inference, vllm</td>
<td>16384</td>
</tr>
<tr>
<td><code>codellama-13b-instruct</code></td>
<td></td>
<td></td>
<td>text-generation-inference, vllm</td>
<td>16384</td>
</tr>
<tr>
<td><code>codellama-34b</code></td>
<td></td>
<td></td>
<td>text-generation-inference, vllm</td>
<td>16384</td>
</tr>
<tr>
<td><code>codellama-34b-instruct</code></td>
<td></td>
<td></td>
<td>text-generation-inference, vllm</td>
<td>16384</td>
</tr>
<tr>
<td><code>zephyr-7b-alpha</code></td>
<td></td>
<td></td>
<td>text-generation-inference, vllm</td>
<td>32768</td>
</tr>
<tr>
<td><code>zephyr-7b-beta</code></td>
<td></td>
<td></td>
<td>text-generation-inference, vllm</td>
<td>32768</td>
</tr>
</tbody>
</table>
Expand Down
2 changes: 1 addition & 1 deletion search/search_index.json

Large diffs are not rendered by default.

Binary file modified sitemap.xml.gz
Binary file not shown.

0 comments on commit 7313c11

Please sign in to comment.