全部标签 



写点什么

登录注册

关于未来 LLM AI 应用的思考，以及从 0 到 1 实现 LLM 资料分析。

作者：马说IT

2023-06-17
中国香港
本文字数：1798 字
阅读完需：约 6 分钟

关于未来LLM AI应用的思考，以及从0到1实现LLM资料分析。

以下内容由作者在从 0 到 1 学习大家 LLM（大语言数据模型）的过程当中收集而来，仅供参考和传播学习，如有涉及版权问题，请联系作者删除。

当前企业级基于 OpenAI 应用实施担心的问题

调用 chatgpt openAI 的接口贵
网络传输慢
信息安全泄露风险

畅想不久的将来基于 LLM 的 AI 应用的样子

首先，我们一定有一种新的职业，他（他）们叫 Prompt 提示工程师，在创作性行业尤其更是如此，并且已经存在我们的身边
未来的 LLM AI 应用，应该具备和 Prompt 交互的能力，并自我不断成长，给用户提供更好的服务和能力
未来的 LLM AI 应用，他应该具备动态学习的能力，不断从身边环境学习新的概念和知识，这方面，基于 Langchain 的 AutoGPT 已经初露拳脚
未来的 LLM AI 应用，它会在自己的专业领域拥有自己独特的数据模型，这也是产品的核心竞争力之一。而要达到这样的目标，那么训练自己的大语言模型就是趋势了！

关于软件行业 Prompt 能力的探索

语言

利用 ChatGPT 开发案例

开源项目 AiWriter

https://github.com/chatgpt-community/aiwriter (by sihan&marvin)

如何训练自己的 LLM 模型 - 资料分享

🧐 了解基本概念和整体思路

🧐 学习 llama 基础

🧐 开源 LLM 模型

https://huggingface.co/medalpaca https://modelscope.cn/models https://github.com/CVI-SZU/Linly https://huggingface.co/decapoda-research/llama-7b-hf https://github.com/ymcui/Chinese-LLaMA-Alpaca https://bair.berkeley.edu/blog/2023/04/03/koala/ https://github.com/OpenBuddy/OpenBuddy/blob/main/README.zh.md https://github.com/brightmart/nlp_chinese_corpus#5翻译语料translation2019zh https://lmsys.org/blog/2023-03-30-vicuna/ https://github.com/google-research/bert

🧐 开源数据集分享

https://github.com/tatsu-lab/stanford_alpaca/blob/main/alpaca_data.json https://huggingface.co/datasets https://zhuanlan.zhihu.com/p/355117358 https://github.com/tatsu-lab/stanford_alpaca https://github.com/ymcui/Chinese-LLaMA-Alpaca/tree/main/data https://github.com/dbiir/UER-py/wiki/Pretraining-data

🧐 LLM model fine-tuneing 模型微调

https://github.com/22-hours/cabrita https://github.com/Tencent/TencentPretrain/blob/main/README_ZH.md https://github.com/dbiir/UER-py/blob/master/README_ZH.md https://github.com/tloen/alpaca-lora

🧐 模型本地化部署和运行

https://orcaterm.cloud.tencent.com/terminal?instanceId=ins-ca0j5xvm&region=ap-guangzhou&regionId=1&source=cvm&from=cvm_console_login_btn https://github.com/ggerganov/llama.cpp https://github.com/antimatter15/alpaca.cpp https://github.com/cocktailpeanut/dalai https://chat.lmsys.org/?model=koala-13b https://github.com/CLUEbenchmark/pCLUE https://huggingface.co/spaces/tloen/alpaca-lora https://console.cloud.tencent.com/message/detail/1286047964

🧐 模型评估

https://wqw547243068.github.io/chatgpt_mimic

总结

学习 LLM 大语言模型 AI 应用生态是一件非常有意思和挑战性的事情，有人说这是新的风口，其实这股风已经刮起了很久，只是缺少了你的参与。

让我们保持开放包容，客观严谨的态度去学习，创作，并应用

如果你有兴趣持续性学习和研究，欢迎私信我，我们一同交流分享学习！

发布于: 刚刚阅读数: 6

马说IT

关注

软件开发、敏捷教练、AI编程、公众演讲 2019-07-21 加入

10年软件开发老炮，沟通演讲和敏捷培训深度玩家，AI入门选手

评论

发布

暂无评论