From c1d218102bee1615c3332eb079ed6e6f3d2fcb80 Mon Sep 17 00:00:00 2001 From: Yu Shi Jie Date: Mon, 23 Dec 2024 12:05:43 -0500 Subject: [PATCH] Falcon3: modified link --- README.md | 2 +- tutorials/download_model_weights.md | 2 +- 2 files changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 1d5e610838..c58a586fdc 100644 --- a/README.md +++ b/README.md @@ -117,7 +117,7 @@ Every model is written from scratch to maximize performance and remove layers of | CodeGemma | 7B | Google | [Google Team, Google Deepmind](https://ai.google.dev/gemma/docs/codegemma) | | Code Llama | 7B, 13B, 34B, 70B | Meta AI | [Rozière et al. 2023](https://arxiv.org/abs/2308.12950) | | Falcon | 7B, 40B, 180B | TII UAE | [TII 2023](https://falconllm.tii.ae) | -| Falcon 3 | 1B, 3B, 7B, 10B | TII UAE | [TII 2024](https://falconllm.tii.ae/falcon3/index.html) | +| Falcon 3 | 1B, 3B, 7B, 10B | TII UAE | [TII 2024](https://huggingface.co/blog/falcon3) | | FreeWilly2 (Stable Beluga 2) | 70B | Stability AI | [Stability AI 2023](https://stability.ai/blog/stable-beluga-large-instruction-fine-tuned-models) | | Function Calling Llama 2 | 7B | Trelis | [Trelis et al. 2023](https://huggingface.co/Trelis/Llama-2-7b-chat-hf-function-calling-v2) | | Gemma | 2B, 7B | Google | [Google Team, Google Deepmind](https://storage.googleapis.com/deepmind-media/gemma/gemma-report.pdf) | diff --git a/tutorials/download_model_weights.md b/tutorials/download_model_weights.md index 5d945e4f0e..a170506c3d 100644 --- a/tutorials/download_model_weights.md +++ b/tutorials/download_model_weights.md @@ -12,7 +12,7 @@ LitGPT supports a variety of LLM architectures with publicly available weights. | Danube2 | 1.8B | H2O.ai | [H2O.ai](https://h2o.ai/platform/danube-1-8b/) | | Dolly | 3B, 7B, 12B | Databricks | [Conover et al. 2023](https://www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm) | | Falcon | 7B, 40B, 180B | TII UAE | [TII 2023](https://falconllm.tii.ae) | -| Falcon 3 | 1B, 3B, 7B, 10B | TII UAE | [TII 2024](https://falconllm.tii.ae/falcon3/index.html) | +| Falcon 3 | 1B, 3B, 7B, 10B | TII UAE | [TII 2024](https://huggingface.co/blog/falcon3) | | FreeWilly2 (Stable Beluga 2) | 70B | Stability AI | [Stability AI 2023](https://stability.ai/blog/stable-beluga-large-instruction-fine-tuned-models) | | Function Calling Llama 2 | 7B | Trelis | [Trelis et al. 2023](https://huggingface.co/Trelis/Llama-2-7b-chat-hf-function-calling-v2) | | Gemma | 2B, 7B | Google | [Google Team, Google Deepmind](https://storage.googleapis.com/deepmind-media/gemma/gemma-report.pdf) |