Coder Social home page Coder Social logo

wangxin1198 / pangu-alpha-tf Goto Github PK

View Code? Open in Web Editor NEW

This project forked from deepdialog/pangu-alpha-tf

0.0 0.0 0.0 1.5 MB

pangu-alpha大模型

License: Apache License 2.0

Shell 1.12% C++ 3.55% Python 74.68% TeX 0.12% Cuda 0.22% Makefile 0.05% Jupyter Notebook 20.19% Dockerfile 0.06%

pangu-alpha-tf's Introduction

PanGu TensorFlow Version

盘古项目Repo

盘古GPU版本Repo

本Repo参考与修改自以上两个Repo中的内容,本Repo内容保持原Repo所写的Apache协议

2.6B的 float16 版本,在A6000上大概占用9GB显存

13B的 float16 版本,在A6000上大概占用34GB显存

注意float16版本尽量不要在CPU上运行,特别慢

float32版本,没有在显卡上测试过,建议2.6B在32GB内存的CPU上运行,13B在超过96GB内存的CPU上运行

本项目的姊妹篇,智源CPM-TF版本地址,这两个模型是有一定区别的,具体可以参考上面PanGu官方Repo链接里面的论文

Demo

具体请查看目录下面这几个ipynb:

可运行的Colab,2.6B fp16:https://colab.research.google.com/drive/12VYofmlZCnJqd2cW-dCnNci9edXuYIlG?usp=sharing

Download

注:百度里面没有13B的fp32,因为太大了传不上去

百度:

链接: https://pan.baidu.com/s/1PQp6bU7StZ84o9fCGsks9w 提取码: 1pbd

GDrive:

https://drive.google.com/drive/folders/1332wY_01r67u9BASS3RghTB3T8xIrTl3?usp=sharing

pangu-alpha-tf's People

Contributors

qhduan avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.