CUDA已经是11.4以上了,安装flash-attention库的时候报错。
 × python setup.py egg_info did not run successfully.
  │ exit code: 1
  ╰─> [11 lines of output]
      Traceback (most recent call last):
        File "", line 2, in 
        File "", line 34, in 
        File "/mnt/20230808/Qwen-7B-main/flash-attention/setup.py", line 113, in 
          raise RuntimeError("FlashAttention is only supported on CUDA 11 and above")
      RuntimeError: FlashAttention is only supported on CUDA 11 and above
  torch.__version__  = 2.0.1+cu117
  [end of output]
ModelScope旨在打造下一代开源的模型即服务共享平台,为泛AI开发者提供灵活、易用、低成本的一站式模型服务产品,让模型应用更简单!欢迎加入技术交流群:微信公众号:魔搭ModelScope社区,钉钉群号:44837352