Coder Social home page Coder Social logo

Unable to replicate RPi0 tests about xnnpack HOT 3 OPEN

samveen avatar samveen commented on June 5, 2024
Unable to replicate RPi0 tests

from xnnpack.

Comments (3)

samveen avatar samveen commented on June 5, 2024

Digging more into the neon dot error shows me that I'm hitting the issue as listed in https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101723 (via #1465 ). As of now, the fix has not made it downstream into Debian bullseye unfortuantely, so successful local builds on Debian Bullseye with the neon-dot kernels is a pipe dream.

Other than that, the other issues still stand.

from xnnpack.

Maratyszcza avatar Maratyszcza commented on June 5, 2024

Try an older revision of XNNPack. We don't regularly test on pre-NEON ARM systems, and probably a recent refactoring broke something.

from xnnpack.

samveen avatar samveen commented on June 5, 2024

@Maratyszcza , I tried that too. The version which last updated the raspberry pi benchmarks table (your commit 3c6d6b4 from Oct 16, 2021). The build failed with the error about unsupported neon instructions, which lead me to the GCC bug report.

I'm in the process of upgrading to Debian Bookwork/Sid to see if I can manage to get the build working, before proceeding further. Upgrade done.

Debian Bookworm/testing is armel, not arm hf, and gcc-12/g++12 generated armel (no VFP) code. So I discarded this and went with Raspberry Pi OS Bookworm/testing which is armhf (armv6+VFP).

Gcc 10.4 and 12.2 are available.

  • Building 3c6d6b4 fails with both 10.4 and 12.2 as below:
[ 45%] Building C object CMakeFiles/all_microkernels.dir/src/f16-f32-vcvt/gen/vcvt-neonfp16-x8.c.o
/usr/bin/cc -DFXDIV_USE_INLINE_ASSEMBLY=0 -DPTHREADPOOL_NO_DEPRECATED_API=1 -DXNN_ENABLE_ASSEMBLY=1 -DXNN_ENABLE_MEMOPT=1 -DXNN_ENABLE_SPARSE=1 -I/home/samveen/XNNPACK/include -I/home/samveen/XNNPACK/src -I/home/samveen/XNNPACK/build/local/pthreadpool-source/include -I/home/samveen/XNNPACK/build/local/FXdiv-source/include -I/home/samveen/XNNPACK/build/local/FP16-source/include -D__ARM_FP16_FORMAT_IEEE=1 -O3 -DNDEBUG -fPIC -Wno-psabi -pthread  -marm  -march=armv7-a -mfpu=neon-fp16  -O2  -MD -MT CMakeFiles/all_microkernels.dir/src/f16-f32-vcvt/gen/vcvt-neonfp16-x8.c.o -MF CMakeFiles/all_microkernels.dir/src/f16-f32-vcvt/gen/vcvt-neonfp16-x8.c.o.d -o CMakeFiles/all_microkernels.dir/src/f16-f32-vcvt/gen/vcvt-neonfp16-x8.c.o -c /home/samveen/XNNPACK/src/f16-f32-vcvt/gen/vcvt-neonfp16-x8.c
/home/samveen/XNNPACK/src/f16-f32-vcvt/gen/vcvt-neonfp16-x8.c: In function ‘xnn_f16_f32_vcvt_ukernel__neonfp16_x8’:
/home/samveen/XNNPACK/src/f16-f32-vcvt/gen/vcvt-neonfp16-x8.c:31:11: error: unknown type name ‘float16x8_t’
   31 |     const float16x8_t vh = vreinterpretq_f16_u16(vld1q_u16(i)); i += 8;
      |           ^~~~~~~~~~~

from xnnpack.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.