annetgpgpu / annetgpgpu Goto Github PK

A GPU (CUDA) based Artificial Neural Network library

License: GNU General Public License v3.0

Shell 0.01% C++ 84.23% C 2.39% Cuda 2.72% CMake 6.42% Python 0.19% SWIG 4.05%

self-organizing-map cuda c-plus-plus-11 propagation-network

annetgpgpu's Introduction

Updates - 05/10/2017: Added a new example

The program "image_generator" is located in the "/src/examples" subdirectory and was submitted by Ben Bogart. It produces abstract looking images from ordinary input images. To compile it, make sure to install OpenCV. If you want to give it a try, it takes two command line arguments and can be used like this:

./image_generator <input> <output>

Here is an example input image with the related output image. CUDA image generator example. The picture illustrates a some houses in vienna. CUDA image generator example. This image was calculated on a GTX 1080. It took apprx. 20 min and 500 MB VRAM.

Updates - 09/03/2016

Improvements of the implementation of the base classes
Added a Qt4 demo, illustrating how to implement a GUI on the example of a back-propagation network
- Note that the networks can be highly asymmetrical

GUI-example: Designer for back propagation networks. The layout of the underlying library is 1:1 represented as a QSceneGraph. After definition of the network topology, the in- and output can be defined by the user and the network trained accordingly. At the end, the error of each test-training cycle is plotted, which gives a handy representation of the network performance.

Updates - 07/31/2016

The project was rewritten into a template library
The CUDA device function pointers are poorly implemented, thus the function extensions work with template parameters now
As most of the code gets created just at compile time, the compile time increased in comparison to the old version
All classes but SOMNetGPU are now header only
A bug with SOMs on GPU was fixed, which led to wrong results of the radius decay function

Here is an example of the new device pointer replacement. The distance function of the network is now a template argument, which will simplify library functionality extensions.

int main(int argc, char *argv[]) {
	// ..
	ANNGPGPU::SOMNetGPU<float, ANN::functor_gaussian<float>> gpu;

Updates - 04/12/2016

The support for CUDA > version 4 was broken. Now it works again (tested with CUDA 7.5)

Projects

Artwork from Ben Bogart

This video is from ekran.org and shows the decomposition of a high resolution panorama by a SOM on the GPU

Linewise growing neighborhood from ekran.org

Introduction

ANNet is a small library to create neural nets. A hallmark of the project are implementations of several neural network models with usage of OpenMP and/or Thrust. See quickstart guide to learn how to use them. Especially self organizing maps (SOMs) benefit strongly from calculations on GPUs and speed-ups by a factor of 100 can easily be achieved for bigger networks (>256x256 nodes). The GPU implementation of SOMs is also supporting asynchronous calculation on more than one device.

Features

Implementation:
- Self organizing maps using CUDA
- Back propagation networks
Python interface for all classes
Multi core support using OpenMP
Plugin system based on template parameters
With the exception of the CUDA implementation, this project is a header only library

Build

To build the library with all features you need:

Qt5 (for some examples required)
SWIG for python bindings (just required for python bindings)
CUDA/Thrust (shipped with CUDA; just required for GPGPU implementation)
Doxygen (required for documentation generation)
OpenMP (required if multi CPU support is wished)
Lib bzip2 (required)
CMake (required if you want to use the CMake scripts)
A C++ compiler (GCC or MinGW; required)
How you build the library:

Clone the repository with git:

git clone https://github.com/ANNetGPGPU/ANNetGPGPU.git

Create a build directory, where your compiler stores the objects:

cd ANNetGPGPU
mkdir build
cd build

Run CMake and make to build. Dependent on the installed libraries, either all or just some example programs will be built:

cmake .. && make

cmake -DCMAKE_CXX_FLAGS="-O2" -DCUDA_NVCC_FLAGS="-O2" ..
make -j8

Usage (Python interface)

There is a python interface for the library, which may have access to the GPU too. The python demo here shows a k-means like clustering approach with the SOM implementation. If the number of nodes is reduced you can calculate the "centroids" of given input vectors. In this example a three-dimensional input was chosen for simplicity. Note: Currently, the template library could work with double precision. However, the python interface is currently implemented for float.

from ANPyNetCPU import *
black 	= vectorf([0,0,0])
white 	= vectorf([1,1,1])
red 	= vectorf([1,0,0])
green 	= vectorf([0,1,0])
blue 	= vectorf([0,0,1])

trainSet = TrainingSetF()
trainSet.AddInput(black)
trainSet.AddInput(white)
trainSet.AddInput(red)
trainSet.AddInput(green)
trainSet.AddInput(blue)

widthMap = 5
heightMap = 1

inpWidth = 3
inpHeight = 1

SOM = SOMNetGaussF(inpWidth,inpHeight,widthMap,heightMap)
SOM.SetTrainingSet(trainSet)
SOM.SetLearningRate(0.75)
SOM.Training(100)

# gets an ordered list of different centroids with the ID of the corresponding BMU
centroids = SOM.GetCentroidList()

# output for fun
for i in centroids:
	print (i)

The output of the demo would be:

White
> 1
> 1
> 1

Red
> 1
> 7.2e-23
> 7.2e-23

Black
> 9.6e-18
> 1.4e-45
> 3.0e-18

Blue
> 2.9e-22
> 1.4e-45
> 1

Usage (C++)

I prepared working examples for many typical use cases. These examples can be found in the "/src/examples" folder. In the following I write a bit about these examples, to help you understand the layout of the library.

1. Back propagation networks

Define the input, e.g. numbers from 0 to 9 could be declared.

float fInp1[56] = { 0,0,0,0,1,0,0,
		    0,0,0,1,1,0,0,
		    0,0,1,0,1,0,0,
		    0,1,0,0,1,0,0,
		    0,0,0,0,1,0,0,
		    0,0,0,0,1,0,0,
		    0,0,0,0,1,0,0,
		    0,0,0,0,1,0,0 };
					
float fInp2[56] = { 0,0,0,1,1,0,0,
		    0,0,1,0,0,1,0,
		    0,1,0,0,0,1,0,
		    0,1,0,0,1,0,0,
		    0,0,0,1,0,0,0,
		    0,0,0,1,0,0,0,
		    0,0,1,0,0,0,0,
		    0,1,1,1,1,1,0 };
					
// ..

AddLayer connects all neurons of both layers with each other. It is also possible to create networks by defining each connection. This can be done by a vector describing the graph. Internally the networks works as a linked list (cpu). For the gpu implementation, a vector is created based on the connection graph.

#include <ANNet>
#include <ANContainers>
#include <ANMath>

#include <Samples.h>

#include <ctime>
#include <iostream>


int main(int argc, char *argv[]) {
	ANN::BPNet<float, ANN::fcn_log<float>> cpu_one;
        
	ANN::BPLayer<float, ANN::fcn_log<float>> *layer1 = cpu_one.AddLayer(56, ANN::ANLayerInput);
	ANN::BPLayer<float, ANN::fcn_log<float>> *layer2 = cpu_one.AddLayer(64, ANN::ANLayerHidden);
	ANN::BPLayer<float, ANN::fcn_log<float>> *layer3 = cpu_one.AddLayer(9, ANN::ANLayerOutput);

	layer1->ConnectLayer(layer2);
	layer2->ConnectLayer(layer3);
	
	ANN::TrainingSet<float> input;
	input.AddInput(fInp1, 56);
	input.AddOutput(fOut1, 9);
	// .. more input/output samples
	
	std::vector<float> errors;
	
	ANN::HebbianConf<float> conf = {0.5, 0, 0};
	cpu_one.Setup(conf);
	cpu_one.SetTrainingSet(input);

	bool b = false;
	float f;
	errors = cpu_one.TrainFromData(50, 0, b, f);
	std::cout<< &cpu_one <<std::endl;

	cpu_one.ExpToFS("foo.bar");
	ANN::BPNet<float, ANN::fcn_log<float>> cpu_two;
	cpu_two.ImpFromFS("foo.bar");
	cpu_two.SetTrainingSet(input);
	
	std::cout<< &cpu_two <<std::endl;
	return 0;
}

HebbianConf is a small struct storing the learning rates and related constants of the network. A zero will automatically disable the related procedure like the momentum term, or weight decay during training.

template <class T>
struct HebbianConf {
	T learning_rate;
	T momentum_rate;
	T weight_decay;
};

2. Self organizing maps (SOMs)

1. CPU implementation

Self-organizing maps (SOM) are a type of network that is trained using unsupervised learning to produce a low-dimensional (typically two-dimensional), representation of training samples. SOMs are different from other networks in the sense that they use a neighborhood function for learning. The example shows a SOM 128x128 node network, each node can process a 3x1 input vector.

ANN::SOMNet<float, ANN::functor_gaussian<float>> cpu;
cpu.CreateSOM(3, 1, 128,128);
cpu.SetTrainingSet(input);
cpu.Training(100, ANN::ANSerialMode);

2. GPGPU implementation

It is possible to speed up the calculation of SOMs by processing them on GPU. One can train, save and load the net with CPU as well and later continue with GPU and the other way round, simply by calling the CTOR:

ANN::SOMNet<float, ANN::functor_gaussian<float>> cpu;
cpu.CreateSOM(3, 1, 128, 128); // input w, h, net w, h
ANN::SOMNetGPU<float, ANN::functor_gaussian<float>> gpu(&cpu);      // use copy CTOR or create it like above

// do stuff
// ..

cpu.ExpToFS(foo.bar);
gpu.ImpFromFS(foo.bar);

3. GPGPU <-> CPU comparison

Figure 1: Training progress on quad core CPU (A) and GTX 570 (B) with three dimensional vectors as input. Both maps have the same size and the computation time was fixed. In the same time the GPU implementation was able to process more iterations, thus the classification of the input vectors resulted in a lower error.

General class extensions

You certainly know that in C++ it is not possible to add methods to an existing class. So how that's possible? The trick is to include in the declaration of a class a file defined by the preprocessor token e.g. "__ConTable_ADDON". You can extent any class and most files like this. The token always have a "__" prefix followed by the class name and the "_ADDON" suffix. Classes/Structs have such injection points always in the "public" space.

template <class T>
struct ConTable {
	// ..
	std::vector<T> m_vMisc;
	
#ifdef __ConTable_ADDON
	#include __ConTable_ADDON
#endif
};

For this we just need to create a new header file, e.g. "foo.h" with the example content:

std::vector<T> m_vYourVector; // vector for your addon
void your_new_function() { }  // your custom function ..

Then we just need to pass the path of this file to the build system. In CMake, this would look like this.

add_definitions(-D__ConTable_ADDON="${SOM_GPU_ADDON_SOURCE_DIR}/foo.h")

Here, "SOM_GPU_ADDON_SOURCE_DIR" is the path of the file "foo.h"

Adding custom learning/distance functions

Adding new functions for the CPU based library

As long as you work with the CPU implementation it is super simple to define your own functions and pass them to the related functor.

/*
 * Here we declare just the basic learning functions 
 */
template <class T>
inline T custom_learn(T fWeight, T fInfluence, T fInput) {
	return fWeight + (fInfluence*(fInput-fWeight) );
}

template <class T>
inline T custom_gaussian_nhood (T dist, T sigmaT) {
	return exp(-pow(dist, 2.f)/(2.f*pow(sigmaT, 2.f)));
}

template <class T>
inline T custom_rad_decay (T sigma0, T t, T lambda) {
	return std::floor(sigma0*exp(-t/lambda) + 0.5f);
}

template <class T>
inline T custom_lrate_decay (T sigma0, T t, T lambda) {
	return sigma0*exp(-t/lambda);
}

/*
 * Here we define the functor for the network 
 */
template<class T> using custom_functor = ANN::DistFunction<T, custom_learn<T>, custom_gaussian_nhood<T>, custom_rad_decay<T>, custom_lrate_decay<T> >;

int main(int argc, char *argv[]) {
	// ..
	ANN::SOMNetCPU<float, custom_functor<float> > cpu;
	// ..

Adding new functions for the GPU based library

Adding new functions/functors for the GPU implementation is not as simple. The NVCC requires instantiation of the class "SOMNetGPU", because not all of the class implementation can be shifted into a regular header file (device code). This means we need to create an instance of our "extended" class right when the library is build.

For this we can make use of the general class extensions. To achieve the same as illustrated in the previous example. We define a file "NewFunctions.h":

template <class T>
inline T __host__ __device__ custom_learn(T fWeight, T fInfluence, T fInput) {
	return fWeight + (fInfluence*(fInput-fWeight) );
}

template <class T>
inline T __host__ __device__ custom_gaussian_nhood (T dist, T sigmaT) {
	return exp(-pow(dist, 2.f)/(2.f*pow(sigmaT, 2.f)));
}

template <class T>
inline T __host__ __device__ custom_rad_decay (T sigma0, T t, T lambda) {
	return std::floor(sigma0*exp(-t/lambda) + 0.5f);
}

template <class T>
inline T __host__ __device__ custom_lrate_decay (T sigma0, T t, T lambda) {
	return sigma0*exp(-t/lambda);
}

template<class T> using custom_functor = ANN::DistFunction<T, custom_learn<T>, custom_gaussian_nhood<T>, custom_rad_decay<T>, custom_lrate_decay<T> >;

.. and a file "NewInstances.h":

template ANNGPGPU::SOMNetGPU<float, custom_functor<float> >::SOMNetGPU();
template ANNGPGPU::SOMNetGPU<double, custom_functor<double> >::SOMNetGPU();
// ..

then we adapt the CMake build file and add the following lines:

# here we pass the extension headers to the build system
add_definitions(-D__Functions_ADDONS="${SOM_GPU_ADDON_SOURCE_DIR}/NewFunctions.h")
add_definitions(-D__SOMNetGPU_INSTANCES="${SOM_GPU_ADDON_SOURCE_DIR}/NewInstances.h")

Advanced implementation guide

Usually a network consists of nodes and edges. Most implementations of neural networks store their data in a array. Calculation is simple with this approach, but asymmetric networks get hard to implement and new functionality is hard to add. To create more complex variants of networks it makes sense to put the information of the network in the edges and the functionality to calculate parts of the network into the nodes. This makes it easier to include new functionality and to re-use old code. Because information flow in neuronal networks is often directed, container classes help to keep an order. The network class on the other hand calls learning or training functions and implements the principal learning procedure, e.g. switching training patterns or break the learning procedure if a certain error limit was hit.

To make the long story short, the three important classes to derive from are:

AbsNeuron
AbsLayer
AbsNet

AbsNeuron

In the library some functions of the abstract base classes are meant to get implemented in the derived class. Here I give a small overview about them.

virtual void AdaptEdges()       = 0;
virtual void CalcValue()        = 0;

In CalcValue() you calculate the data you want to store in the neuron. Every neuron (or node) in the network has a list of edges which direct to neurons of another (or the same) layer. This example shows you how to run through this list to implement a neuron in a back propagation network.

template <class Type, class Functor>
void BPNeuron<Type, Functor>::CalcValue() {
	if(this->GetConsI().size() == 0) {
		return;
	}

	Type val = 0;
	for(unsigned int i = 0; i < this->GetConsI().size(); i++) {
		AbsNeuron<Type> *from = this->GetConI(i)->GetDestination(this);
		val += from->GetValue() * this->GetConI(i)->GetValue();
	}
	this->SetValue(val);

	val = Functor::transfer( this->GetValue(), 0.f );
	this->SetValue(val);
}

The algorithm to adapt the edges is implemented in AdaptEdges(). Again we use the internal list to run through all edges (outgoing ones) the neuron is connected with.

template <class Type, class Functor>
void BPNeuron<Type, Functor>::AdaptEdges() {
	if(this->GetConsO().size() == 0)
		return;

	AbsNeuron<Type> *pCurNeuron;
	Edge<Type> 	*pCurEdge;
	Type 		val;

	// calc error deltas
	val = this->GetErrorDelta();
	for(unsigned int i = 0; i < this->GetConsO().size(); i++) {
		pCurEdge 	= this->GetConO(i);
		pCurNeuron 	= pCurEdge->GetDestination(this);
		val += pCurNeuron->GetErrorDelta() * pCurEdge->GetValue();
	}
	
	val *= Functor::derivate( this->GetValue(), 0.f );
	this->SetErrorDelta(val);

	// adapt weights
	for(unsigned int i = 0; i < this->GetConsO().size(); i++) {
		pCurEdge = this->GetConO(i);
		if(pCurEdge->GetAdaptationState() == true) {
			val = Functor::learn( 	this->GetValue(), 
						pCurEdge->GetValue(), 
						pCurEdge->GetMomentum(),
						pCurEdge->GetDestination(this)->GetErrorDelta(),
						m_Setup );
			
			pCurEdge->SetMomentum( val );
			pCurEdge->SetValue( val+pCurEdge->GetValue() );
		}
	}
}

AbsLayer

Neurons are stored in layers. If you decide to write your own layer class, then you have to implement the Resize() function. This could be useful especially if you have strange layer topologies (e.g. 2-dimensional or 3-dimensional).

template <class Type, class Functor>
void BPLayer<Type, Functor>::Resize(const unsigned int &iSize) {
	this->EraseAll();
	this->AddNeurons(iSize);
}

AbsNet

The last class you may want to derive from, is AbsNet. Here are three functions you have to overload: PropagateFW() and PropagateBW().

virtual void PropagateFW() = 0;
virtual void PropagateBW() = 0;

Here I show, how to implement these functions in a back propagation network.

template <class Type, class Functor>
void BPNet<Type, Functor>::PropagateFW() {
	for(unsigned int i = 1; i < this->m_lLayers.size(); i++) {
		BPLayer<Type, Functor> *curLayer = ((BPLayer<Type, Functor>*)this->GetLayer(i) );
		//#pragma omp parallel for
		for(int j = 0; j < static_cast<int>(curLayer->GetNeurons().size() ); j++) {
			curLayer->GetNeuron(j)->CalcValue();
		}
	}
}

template <class Type, class Functor>
void BPNet<Type, Functor>::PropagateBW() {
	for(int i = this->m_lLayers.size()-1; i >= 0; i--) {
		BPLayer<Type, Functor> *curLayer = ( (BPLayer<Type, Functor>*)this->GetLayer(i) );
		//#pragma omp parallel for
		for(int j = 0; j < static_cast<int>( curLayer->GetNeurons().size() ); j++) {
			curLayer->GetNeuron(j)->AdaptEdges();
		}
	}
}

Different Implementations use different types of layers. This is why you may want to overload AddLayer().

template <class Type, class Functor>
void BPNet<Type, Functor>::AddLayer(const unsigned int &iSize, const LayerTypeFlag &flType) {
	this->AddLayer( new BPLayer<Type, Functor>(iSize, flType, -1) );
}

Save and load your custom class data to file-system

Example: Back propagation networks

If you decide to create your own e.g. Layer class, then you probably want to add features which require to be stored on the hdd. For this the ExpToFS() and ImpFromFS() functions are required to get modified. This works more or less like reimplementing virtual functions in Qt. Calling the virtual base class ensures to save the base class contents. The following example shows how the freshly inserted integer storing the layer depth in a back propagation network can be saved and recovered from the hdd.

template <class Type, class Functor>
void BPLayer<Type, Functor>::ExpToFS(BZFILE* bz2out, int iBZ2Error) {
	std::cout<<"Save BPLayer to FS()"<<std::endl;
	AbsLayer<Type>::ExpToFS(bz2out, iBZ2Error);

	int iZLayer = m_iZLayer;
	BZ2_bzWrite( &iBZ2Error, bz2out, &iZLayer, sizeof(int) );
}

Now the other way round, we load the content from the file-system.

template <class Type, class Functor>
int BPLayer<Type, Functor>::ImpFromFS(BZFILE* bz2in, int iBZ2Error, ConTable<Type> &Table) {
	std::cout<<"Load BPLayer from FS()"<<std::endl;
	int iLayerID = AbsLayer<Type>::ImpFromFS(bz2in, iBZ2Error, Table);

	int iZLayer = -1;
	BZ2_bzRead( &iBZ2Error, bz2in, &iZLayer, sizeof(int) );
	Table.ZValOfLayer.push_back(iZLayer);

	return iLayerID;
}

The last function which has to be implemented is CreateNet(). Here the content loaded from the file system is used to create a copy of the net in the memory. The base implementation creates the layers and the connections of the network, so we just have to implement the bias neuron.

template <class Type, class Functor>
void BPNet<Type, Functor>::CreateNet(const ConTable<Type> &Net) {
	std::cout<<"Create BPNet"<<std::endl;

	/*
	* Init
	*/
	unsigned int iDstNeurID = 0;
	unsigned int iDstLayerID = 0;
	unsigned int iSrcLayerID = 0;

	Type fEdgeValue = 0.f;

	AbsLayer<Type> *pDstLayer = NULL;
	AbsLayer<Type> *pSrcLayer = NULL;
	AbsNeuron<Type> *pDstNeur = NULL;
	AbsNeuron<Type> *pSrcNeur = NULL;

	/*
	* For all nets necessary: Create Connections (Edges)
	*/
	AbsNet<Type>::CreateNet(Net);

	/*
	* Support z-layers
	*/
	for(unsigned int i = 0; i < this->m_lLayers.size(); i++) {
		BPLayer<Type, Functor> *curLayer = ((BPLayer<Type, Functor>*)this->GetLayer(i) );
		curLayer->SetZLayer(Net.ZValOfLayer[i]);
	}
}

Example: Self organizing maps

Here is another example for SOMs. Only the import of the positions has to be added.

template<class Type, class Functor>
void SOMNet<Type, Functor>::CreateNet(const ConTable<Type> &Net) {
	std::cout<<"Create SOMNet"<<std::endl;

	/*
	* For all nets necessary: Create Connections (Edges)
	*/
	AbsNet<Type>::CreateNet(Net);

	/*
	* Set Positions
	*/
	for(unsigned int i = 0; i < Net.Neurons.size(); i++) {
		int iLayerID 	= Net.Neurons.at(i).m_iLayerID;
		int iNeurID 	= Net.Neurons.at(i).m_iNeurID;
		
		// Get position
		int iPosSize = Net.Neurons.at(i).m_vMisc.at(0);
		std::vector<Type> vPos(iPosSize);
		for(int j = 0; j < iPosSize; j++) {
			vPos[j] = Net.Neurons.at(i).m_vMisc[1+j];
		}
		
		// Save other information of the neuron
		ANN::SOMNeuron<Type> *pNeuron = (ANN::SOMNeuron<Type> *)this->GetLayer(iLayerID)->GetNeuron(iNeurID);
		pNeuron->SetPosition(vPos);
		pNeuron->SetLearningRate(Net.Neurons.at(i).m_vMisc[iPosSize+1]);
		pNeuron->SetSigma0(Net.Neurons.at(i).m_vMisc[iPosSize+2]);
	}
}

annetgpgpu's People

Contributors

Stargazers

Watchers

annetgpgpu's Issues

SOM training comparison on CPU and GPU

Demos: Designer

Figure 2: This shows how a basic Frontend implementation with Qt may look like. In the example just back propagation networks are used.

Vimeo: Application Demo

GetCentroidList() and GetCentrOInpList() return inconsistent data.

Lines 409 to 412 in SOMNet.cpp do the following:

// Count the number of centroids std::sort(vCentroids.begin(), vCentroids.end() ); vCentroids.erase(std::unique(vCentroids.begin(), vCentroids.end()), vCentroids.end() ); std::cout<<"Number of clusters found: "<<vCentroids.size()<<std::endl;

Which, of course, messes up both the order of the centroids (by sorting) and removes centroids with the same values. I don't think the resulting vCentroids has any meaning after this processing and makes the behaviour of GetCentroidList() and GetCentrOInpList() inconsistent!

I think this is probably just a bug and these lines should be removed, or vCentroids should be copied into a new var before the centroid "count".

I am trying to use the ANPyNetCPU library. Building works:

/ > cd /var/tmp
/var/tmp > git clone https://github.com/ANNetGPGPU/ANNetGPGPU
Klone nach 'ANNetGPGPU' ...
remote: Counting objects: 1455, done.
remote: Total 1455 (delta 0), reused 0 (delta 0), pack-reused 1455
Empfange Objekte: 100% (1455/1455), 1.01 MiB | 740.00 KiB/s, Fertig.
Löse Unterschiede auf: 100% (952/952), Fertig.
Prüfe Konnektivität ... Fertig.
/var/tmp > cd ANNetGPGPU/
/var/tmp/ANNetGPGPU > sh build.sh 
-- The C compiler identification is GNU 5.3.1
-- The CXX compiler identification is GNU 5.3.1
-- Check for working C compiler: /usr/bin/cc
-- Check for working C compiler: /usr/bin/cc -- works
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Detecting C compile features
-- Detecting C compile features - done
-- Check for working CXX compiler: /usr/bin/c++
-- Check for working CXX compiler: /usr/bin/c++ -- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- Found Doxygen: /usr/bin/doxygen (found version "1.8.10") 
-- Found BZip2: /usr/lib64/libbz2.so (found version "1.0.6") 
-- Looking for BZ2_bzCompressInit in /usr/lib64/libbz2.so
-- Looking for BZ2_bzCompressInit in /usr/lib64/libbz2.so - found
-- Try OpenMP C flag = [-fopenmp]
-- Performing Test OpenMP_FLAG_DETECTED
-- Performing Test OpenMP_FLAG_DETECTED - Success
-- Try OpenMP CXX flag = [-fopenmp]
-- Performing Test OpenMP_FLAG_DETECTED
-- Performing Test OpenMP_FLAG_DETECTED - Success
-- Found OpenMP: -fopenmp  
CUDA_TOOLKIT_ROOT_DIR not found or specified
-- Could NOT find CUDA (missing:  CUDA_TOOLKIT_ROOT_DIR CUDA_NVCC_EXECUTABLE CUDA_INCLUDE_DIRS CUDA_CUDART_LIBRARY) 
-- CUDA must be found before CMake looks for Thrust!
-- Found SWIG: /usr/bin/swig (found version "3.0.7") 
-- Found PythonLibs: /usr/lib64/libpython3.4m.so (found version "3.4.2") 
-- Try OpenMP C flag = [-fopenmp]
-- Performing Test OpenMP_FLAG_DETECTED
-- Performing Test OpenMP_FLAG_DETECTED - Success
-- Try OpenMP CXX flag = [-fopenmp]
-- Performing Test OpenMP_FLAG_DETECTED
-- Performing Test OpenMP_FLAG_DETECTED - Success
-- Found unsuitable Qt version "" from NOTFOUND
CUDA_TOOLKIT_ROOT_DIR not found or specified
-- Could NOT find CUDA (missing:  CUDA_TOOLKIT_ROOT_DIR CUDA_NVCC_EXECUTABLE CUDA_INCLUDE_DIRS CUDA_CUDART_LIBRARY) 
-- Configuring done
-- Generating done
-- Build files have been written to: /var/tmp/ANNetGPGPU/build
Scanning dependencies of target ANNet
[  3%] Building CXX object src/CMakeFiles/ANNet.dir/2DArray.cpp.o
[  7%] Building CXX object src/CMakeFiles/ANNet.dir/3DArray.cpp.o
[ 11%] Building CXX object src/CMakeFiles/ANNet.dir/AbsLayer.cpp.o
[ 14%] Building CXX object src/CMakeFiles/ANNet.dir/AbsNet.cpp.o
[ 18%] Building CXX object src/CMakeFiles/ANNet.dir/AbsNeuron.cpp.o
[ 22%] Building CXX object src/CMakeFiles/ANNet.dir/BPLayer.cpp.o
[ 25%] Building CXX object src/CMakeFiles/ANNet.dir/BPNet.cpp.o
[ 29%] Building CXX object src/CMakeFiles/ANNet.dir/BPNeuron.cpp.o
[ 33%] Building CXX object src/CMakeFiles/ANNet.dir/Edge.cpp.o
[ 37%] Building CXX object src/CMakeFiles/ANNet.dir/Functions.cpp.o
[ 40%] Building CXX object src/CMakeFiles/ANNet.dir/HFLayer.cpp.o
[ 44%] Building CXX object src/CMakeFiles/ANNet.dir/HFNet.cpp.o
[ 48%] Building CXX object src/CMakeFiles/ANNet.dir/HFNeuron.cpp.o
[ 51%] Building CXX object src/CMakeFiles/ANNet.dir/SOMLayer.cpp.o
[ 55%] Building CXX object src/CMakeFiles/ANNet.dir/SOMNet.cpp.o
[ 59%] Building CXX object src/CMakeFiles/ANNet.dir/SOMNeuron.cpp.o
[ 62%] Building CXX object src/CMakeFiles/ANNet.dir/TrainingSet.cpp.o
[ 66%] Linking CXX static library libANNet.a
[ 66%] Built target ANNet
[ 70%] Swig source
Scanning dependencies of target _ANPyNetCPU
[ 74%] Building CXX object src/CMakeFiles/_ANPyNetCPU.dir/PyNetCPUPYTHON_wrap.cxx.o
[ 77%] Linking CXX shared module _ANPyNetCPU.so
[ 77%] Built target _ANPyNetCPU
Scanning dependencies of target BPNetCPU
[ 81%] Building CXX object examples/misc/CMakeFiles/BPNetCPU.dir/BPNetCPU.cpp.o
[ 85%] Linking CXX executable BPNetCPU
[ 85%] Built target BPNetCPU
Scanning dependencies of target HFNet
[ 88%] Building CXX object examples/misc/CMakeFiles/HFNet.dir/HFNet.cpp.o
[ 92%] Linking CXX executable HFNet
[ 92%] Built target HFNet
Scanning dependencies of target LoadBPNet
[ 96%] Building CXX object examples/misc/CMakeFiles/LoadBPNet.dir/LoadBPNet.cpp.o
[100%] Linking CXX executable LoadBPNet
[100%] Built target LoadBPNet
/var/tmp/ANNetGPGPU >

The Python module seems to be built:

/var/tmp/ANNetGPGPU > ls -l /var/tmp/ANNetGPGPU/build/src/
insgesamt 5036
-rw-rw-r--. 1 build build   83576 15. Mär 12:08 ANPyNetCPU.py
-rwxrwxr-x. 1 build build 1901152 15. Mär 12:08 _ANPyNetCPU.so
drwxrwxr-x. 4 build build    4096 15. Mär 12:08 CMakeFiles
-rw-rw-r--. 1 build build     978 15. Mär 12:08 cmake_install.cmake
drwxrwxr-x. 2 build build    4096 15. Mär 12:08 include
-rw-rw-r--. 1 build build 1498262 15. Mär 12:08 libANNet.a
-rw-rw-r--. 1 build build   19590 15. Mär 12:08 Makefile
-rw-rw-r--. 1 build build 1634028 15. Mär 12:08 PyNetCPUPYTHON_wrap.cxx

However, loading the module in Python results in an error:

/var/tmp/ANNetGPGPU > cd /var/tmp/ANNetGPGPU/build/src
/var/tmp/ANNetGPGPU/build/src > python
Python 2.7.10 (default, Sep 24 2015, 17:50:09) 
[GCC 5.1.1 20150618 (Red Hat 5.1.1-4)] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> import ANPyNetCPU
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "ANPyNetCPU.py", line 34, in <module>
    _ANPyNetCPU = swig_import_helper()
  File "ANPyNetCPU.py", line 30, in swig_import_helper
    _mod = imp.load_module('_ANPyNetCPU', fp, pathname, description)
ImportError: dynamic module does not define init function (init_ANPyNetCPU)
>>>

OS is:
Fedora 22, AMD64

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.