GitHub - g7n3/gorgonia: Gorgonia is a library that helps facilitate machine learning in Go.

#Gorgonia #

Gorgonia is a library that helps facilitate machine learning in Go. Write and evaluate mathematical equations involving multidimensional arrays easily. If this sounds like Theano or TensorFlow, it's because the idea is quite similar. Specifically, the library is pretty low-level, like Theano, but has higher goals like Tensorflow.

Gorgonia:

Can perform automatic differentiation
Can perform symbolic differentiation
Can perform gradient descent optimizations
Can perform numerical stabilization
Provides a number of convenience functions to help create neural networks
Is fairly quick (comparable to Theano and Tensorflow's speed)
Supports CUDA/GPGPU computation (OpenCL not yet supported, send a pull request)
Will support distributed computing

#Why Use Gorgonia?#

The main reason to use Gorgonia is developer comfort. If you're using a Go stack extensively, now you have access to the ability to create production-ready machine learning systems in an environment that you are already familiar and comfortable with.

ML/AI at large is usually split into two stages: the experimental stage where one builds various models, test and retest; and the deployed state where a model after being tested and played with, is deployed. This necessitate different roles like data scientist and data engineer.

Typically the two phases have different tools: Python/Lua (using Theano, Torch, etc) is commonly used for the experimental stage, and then the model is rewritten in some more performant language like C++ (using dlib, mlpack etc). Of course, nowadays the gap is closing and people frequently share the tools between them. Tensorflow is one such tool that bridges the gap.

Gorgonia aims to do the same, but for the Go environment. Gorgonia is currently fairly performant - its speeds are comparable to Theano's and Tensorflow's (official benchmarks haven't yet been done because of an existing CUDA bug in Gorgonia; and also the implementations may differ slightly so an exact like-for-like model is hard to compare).

#Installation #

The package is go-gettable: go get -u github.com/chewxy/gorgonia.

There are very few dependencies that Gorgonia uses - and they're all pretty stable, so as of now there isn't a need for vendoring tools. These are the list of external packages that Gorgonia calls, ranked in order of reliance that this package has (subpackages are omitted):

Package	Used For	Vitality	Notes	Licence
gonum/graph	Sorting `*ExprGraph`	Vital. Removal means Gorgonia will not work	Development of Gorgonia is committed to keeping up with the most updated version	gonum license (MIT/BSD-like)
gonum/blas	Tensor subpackage linear algebra operations	Vital. Removal means Gorgonial will not work	Development of Gorgonia is committed to keeping up with the most updated version	gonum license (MIT/BSD-like)
cu	CUDA drivers	Needed for CUDA operations	Same maintainer as Gorgonia	MIT/BSD-like
math32	`float32` operations	Can be replaced by `float32(math.XXX(float64(x)))`	Same maintainer as Gorgonia, same API as the built in `math` package	MIT/BSD-like
hm	Type system for Gorgonia	Gorgonia's graphs are pretty tightly coupled with the type system	Same maintainer as Gorgonia	MIT/BSD-like
vecf64	optimized `[]float64` operations	Can be generated in the `tensor/genlib` package. However, plenty of optimizations have been made/will be made	Same maintainer as Gorgonia	MIT/BSD-like
vecf32	optimized `[]float32` operations	Can be generated in the `tensor/genlib` package. However, plenty of optimizations have been made/will be made	Same maintainer as Gorgonia	MIT/BSD-like
set	Various set operations	Can be easily replaced	Stable API for the past 1 year	set licence (MIT/BSD-like)
gographviz	Used for printing graphs	Graph printing is only vital to debugging. Gorgonia can survive without, but with a major (but arguably nonvital) feature loss	Stable API for the past 1 year	gographviz licence (Apache 2.0)
rng	Used to implement helper functions to generate initial weights	Can be replaced fairly easily. Gorgonia can do without the convenience functions too		rng licence (Apache 2.0)
errors	Error wrapping	Gorgonia won't die without it. In fact Gorgonia has also used goerrors/errors in the past.	Stable API for the past 6 months	errors licence (MIT/BSD-like)
gonum/matrix	Compatibility between `Tensor` and Gonum's Matrix	Development of Gorgonia is committed to keeping up with the most updated version	gonum license (MIT/BSD-like)
testify/assert	Testing	Can do without but will be a massive pain in the ass to test		testify licence (MIT/BSD-like)

#Keeping Updated#

Gorgonia's project has a mailing list, as well as a Twitter account. Official updates and announcements will be posted to those two sites.

#Usage#

Gorgonia works by creating a computation graph, and then executing it. Think of it as a programming language, but is limited to mathematical functions. In fact this is the dominant paradigm that the user should be used to thinking about. The computation graph is an AST.

Microsoft's CNTK, with its BrainScript, is perhaps the best at exemplifying the idea that building of a computation graph and running of the computation graphs are different things, and that the user should be in different modes of thoughts when going about them.

Whilst Gorgonia's implementation doesn't enforce the separation of thought as far as CNTK's BrainScript does, the syntax does help a little bit.

Here's an example - say you want to define a math expression z = x + y. Here's how you'd do it:

package main

import (
	"fmt"
	"log"

	. "github.com/chewxy/gorgonia"
)

func main() {
	g := NewGraph()

	var x, y, z *Node
	var err error

	// define the expression
	x = NewScalar(g, Float64, WithName("x"))
	y = NewScalar(g, Float64, WithName("y"))
	z, err = Add(x, y)
	if err != nil {
		log.Fatal(err)
	}

	// compile into a program
	prog, locMap, err := Compile(g)
	if err != nil {
		log.Fatal(err)
	}

	// create a VM to run the program on
	machine := NewTapeMachine(prog, locMap)

	// set initial values then run
	Let(x, 2.0)
	Let(y, 2.5)
	if machine.RunAll() != nil {
		log.Fatal(err)
	}

	fmt.Printf("%v", z.Value())
	// Output: 4.5
}

You might note that it's a little more verbose than other packages of similar nature. For example, instead of compiling to a callable function, Gorgonia specifically compiles into a *program which requires a *TapeMachine to run. It also requires manual a Let(...) call.

The author would like to contend that this is a Good Thing - to shift one's thinking to a machine-based thinking. It helps a lot in figuring out where things might go wrong.

###VMs###

There are two VMs in the current version of Gorgonia:

TapeMachine
LispMachine

They function differently and take different inputs. The TapeMachine is useful for executing expressions that are generally static (that is to say the computation graph does not change). Due to its static nature, the TapeMachine is good for running expressions that are compiled-once-run-many-times (such as linear regression, SVM and the like).

The LispMachine on the other hand was designed to take a graph as an input, and executes directly on the nodes of the graph. If the graph change, simply create a new lightweight LispMachine to execute it on. The LispMachine is suitable for tasks such as creating recurrent neural networks without a fixed size.

Prior to release of Gorgonia, there was a third VM - a stack based VM that is similar to TapeMachine but deals with artificial gradients better. It may see light of day again, once this author has managed to fix all the kinks.

##Differentiation##

Gorgonia performs both symbolic and automatic differentiation. There are subtle differences between the two processes. The author has found that it's best to think of it this way - Automatic differentiation is differentiation that happens at runtime, concurrently with the execution of the graph, while symbolic differentiation is differentiation that happens during the compilation phase.

Runtime of course, refers to the execution of the expression graph, not the program's actual runtime.

With the introduction to the two VMs, it's easy to see how Gorgonia can perform both symbolic and automatic differentiation. Using the same example as above, the reader should note that there was no differentiation done. Instead, let's try with a LispMachine:

package main

import (
	"fmt"
	"log"

	. "github.com/chewxy/gorgonia"
)

func main() {
	g := NewGraph()

	var x, y, z *Node
	var err error

	// define the expression
	x = NewScalar(g, Float64, WithName("x"))
	y = NewScalar(g, Float64, WithName("y"))
	z, err = Add(x, y)
	if err != nil {
		log.Fatal(err)
	}

	// set initial values then run
	Let(x, 2.0)
	Let(y, 2.5)

	// by default, LispMachine performs forward mode and backwards mode execution
	m := NewLispMachine(g)
	if m.RunAll() != nil {
		log.Fatal(err)
	}

	fmt.Printf("z: %v\n", z.Value())

	xgrad, err := x.Grad()
	if err != nil {
		log.Fatal(err)
	}
	fmt.Printf("dz/dx: %v\n", xgrad)

	ygrad, err := y.Grad()
	if err != nil {
		log.Fatal(err)
	}
	fmt.Printf("dz/dy: %v\n", ygrad)

	// Output:
	// z: 4.5
	// dz/dx: 1
	// dz/dy: 1
}

Of course, Gorgonia also supports the more traditional symbolic differentiation like in Theano:

package main

import (
	"fmt"
	"log"

	. "github.com/chewxy/gorgonia"
)

func main() {
	g := NewGraph()

	var x, y, z *Node
	var err error

	// define the expression
	x = NewScalar(g, Float64, WithName("x"))
	y = NewScalar(g, Float64, WithName("y"))
	z, err = Add(x, y)
	if err != nil {
		log.Fatal(err)
	}

	// symbolically differentiate z with regards to x and y
	// this adds the gradient nodes to the graph g
	var grads Nodes
	grads, err = Grad(z, x, y)
	if err != nil {
		log.Fatal(err)
	}

	// compile into a program
	prog, locMap, err := Compile(g)
	if err != nil {
		log.Fatal(err)
	}

	// create a VM to run the program on
	machine := NewTapeMachine(prog, locMap)

	// set initial values then run
	Let(x, 2.0)
	Let(y, 2.5)
	if machine.RunAll() != nil {
		log.Fatal(err)
	}

	fmt.Printf("z: %v\n", z.Value())

	xgrad, err := x.Grad()
	if err != nil {
		log.Fatal(err)
	}
	fmt.Printf("dz/dx: %v | %v\n", xgrad, grads[0])

	ygrad, err := y.Grad()
	if err != nil {
		log.Fatal(err)
	}
	fmt.Printf("dz/dy: %v | %v\n", ygrad, grads[1])

	// Output:
	// z: 4.5
	// dz/dx: 1 | 1
	// dz/dy: 1 | 1
}

Currently Gorgonia only performs backwards mode automatic differentiation (aka backpropagation), although one may observe the vestiges of an older version which supported forwards mode differentiation in the existence of *dualValue. It may return in the future.

##Graph##

A lot has been said about a computation graph or an expression graph. But what is it exactly? Think of it as an AST for the math expression that you want. Here's the graph for the examples (but with a vector and a scalar addition instead) above:

By the way, Gorgonia comes with nice-ish graph printing abilities. Here's an example of a graph of the equation y = x² and its derivation:

To read the graph is easy. The expression builds from bottom up, while the derivations build from top down. This way the derivative of each node is roughly on the same level.

Red-outlined nodes indicate that it's a root node. Green outlined nodes indicate that they're a leaf node. Nodes with a yellow background indicate that it's an input node. The dotted arrows indicate which node is the gradient node for the pointed-to node.

Concretely, it says that c42011e840 (dy/dx) is the gradient node of the input c42011e000 (which is x).

###Node Rendering###

A Node is rendered thusly:

ID	node name :: type
OP*	op name :: type
shape
compilation metadata
Value†	Gradient

###Additional Notes###

If it's an input node, then the Op row will not show up.
If there are no Values bound to the node, it will show up as NIL. However, when there are values and gradients, it will try to as best as possible display the values bound to the node.

##Using CUDA ##

Gorgonia comes with CUDA support out of the box. However, usage is specialized. To use CUDA, you must build your application with the build tag cuda, like so:

go build -tags='cuda' .

Furthermore, there are some additional requirements:

CUDA toolkit 8.0 is required. Installing this installs the nvcc compiler which is required to run your code with CUDA
go install github.com/chewxy/gorgonia/cmd/cudagen. This installs the cudagen program. Running cudagen will generate the relevant CUDA related code for Gorgonia.
The CUDA ops must be manually enabled in your code with the UseCudaFor option.
runtime.LockOSThread() must be called in the main function where the VM is running. CUDA requires thread affinity, and therefore the OS thread must be locked.

###Example ###

So how do we use CUDA? Say we've got a file, main.go:

import (
	"fmt"
	"log"
	"runtime"

	T "github.com/chewxy/gorgonia"
	"github.com/chewxy/gorgonia/tensor"
)

func main() {
	g := T.NewGraph()
	x := T.NewMatrix(g, T.Float32, T.WithName("x"), T.WithShape(100, 100))
	y := T.NewMatrix(g, T.Float32, T.WithName("y"), T.WithShape(100, 100))
	xpy := T.Must(T.Add(x, y))
	xpy2 := T.Must(T.Tanh(xpy))

	prog, locMap, _ := T.Compile(g)
	m := T.NewTapeMachine(prog, locMap, T.UseCudaFor("tanh"))

	T.Let(x, tensor.New(tensor.WithShape(100, 100), tensor.WithBacking(tensor.Random(tensor.Float32, 100*100))))
	T.Let(y, tensor.New(tensor.WithShape(100, 100), tensor.WithBacking(tensor.Random(tensor.Float32, 100*100))))

	runtime.LockOSThread()
	for i := 0; i < 1000; i++ {
		if err := m.RunAll(); err != nil {
			log.Fatalf("iteration: %d. Err: %v", i, err)
		}
	}
	runtime.UnlockOSThread()

	fmt.Printf("%1.1f", xpy2.Value())
}

If this is run normally:

go run main.go

CUDA will not be used.

If the program is to be run using CUDA, then this must be invoked:

go run -tags='cuda'

And even so, only the tanh function uses CUDA.

###Rationale ###

The main reasons for having such complicated requirements for using CUDA is quite simply performance related. As Dave Cheney famously wrote, cgo is not Go. To use CUDA, cgo is unfortunately required. And to use cgo, plenty of tradeoffs need to be made.

Therefore the solution was to nestle the CUDA related code in a build tag, cuda. That way by default no cgo is used (well, kind-of - you could still use cblas or blase).

The reason for requiring CUDA toolkit 8.0 is because there are many CUDA Compute Capabilities, and generating code for them all would yield a huge binary for no real good reason. Rather, users are encouraged to compile for their specific Compute Capabilities.

Lastly, the reason for requiring an explicit specification to use CUDA for which ops is due to the cost of cgo calls. Additional work is being done currently to implement batched cgo calls, but until that is done, the solution is keyhole "upgrade" of certain ops

###Ops supported by CUDA###

As of now, only the very basic simple ops support CUDA:

Elementwise unary operations:

abs
sin
cos
exp
ln
log2
neg
square
sqrt
inv (reciprocal of a number)
cube
tanh
sigmoid
log1p
expm1
softplus

Elementwise binary operations - only arithmetic operations support CUDA:

add
sub
mul
div
pow

From a lot of profiling of this author's personal projects, the ones that really matter are tanh, sigmoid, expm1, exp and cube - basically the activation functions. The other operations do work fine with MKL+AVX and aren't the major cause of slowness in a neural network

###CUDA improvements ###

In a trivial benchmark, careful use of CUDA (in this case, used to call sigmoid) shows impressive improvements over non-CUDA code (bearing in mind the CUDA kernel is extremely naive and not optimized):

BenchmarkOneMilCUDA-8   	     300	   3348711 ns/op
BenchmarkOneMil-8       	      50	  33169036 ns/op

#API Stability# Gorgonia's API is as of right now, not considered stable. It will be stable from version 1.0 forwards.

1.0 is defined by when the test coverage hits 90%, and the relevant Tensor methods have been completed.

#Roadmap#

Here are the goals for Gorgonia, sorted by importance

#Goals# The primary goal for Gorgonia is to be a highly performant machine learning/graph computation-based library that can scale across multiple machines. It should bring the appeal of Go (simple compilation and deployment process) to the ML world. It's a long way from there currently, however, the baby steps are already there.

The secondary goal for Gorgonia is to provide a platform for exploration for non-standard deep-learning and neural network related things. This includes things like neo-hebbian learning, corner-cutting algorithms, evolutionary algorithms and the like.

#Contributing#

Obviously since you are most probably reading this on Github, Github will form the major part of the workflow for contributing to this package.

Name		Name	Last commit message	Last commit date
Latest commit History 308 Commits
.travis/linux		.travis/linux
blase		blase
cmd/cudagen		cmd/cudagen
cuda modules		cuda modules
examples		examples
media		media
tensor		tensor
.gitignore		.gitignore
.travis.yml		.travis.yml
CONTRIBUTING.md		CONTRIBUTING.md
CONTRIBUTORS.md		CONTRIBUTORS.md
DEVNOTES.md		DEVNOTES.md
LICENSE		LICENSE
README.md		README.md
analysis.go		analysis.go
bench_typesystem_test.go		bench_typesystem_test.go
blas.go		blas.go
broadcast.go		broadcast.go
broadcast_test.go		broadcast_test.go
collections.go		collections.go
collections_test.go		collections_test.go
compile.go		compile.go
const.go		const.go
cuda.go		cuda.go
cuda_test.go		cuda_test.go
debug.go		debug.go
device.go		device.go
device_cuda.go		device_cuda.go
differentiation.go		differentiation.go
differentiation_test.go		differentiation_test.go
doc.go		doc.go
dual.go		dual.go
dual_test.go		dual_test.go
equalities.go		equalities.go
equalities_test.go		equalities_test.go
errors.go		errors.go
example_autodiff_test.go		example_autodiff_test.go
example_basic_test.go		example_basic_test.go
example_linearregression_test.go		example_linearregression_test.go
example_symdiff_test.go		example_symdiff_test.go
formatter.go		formatter.go
formatter_test.go		formatter_test.go
gorgonia.go		gorgonia.go
gorgonia_test.go		gorgonia_test.go
graph.go		graph.go
graph_test.go		graph_test.go
math.go		math.go
math_fast.go		math_fast.go
math_nooptim.go		math_nooptim.go
nn.go		nn.go
nn_test.go		nn_test.go
node.go		node.go
node_set.go		node_set.go
node_test.go		node_test.go
noextern.go		noextern.go
noextern_test.go		noextern_test.go
op.go		op.go
op_infidel.go		op_infidel.go
op_math.go		op_math.go
op_math_cuda.go		op_math_cuda.go
op_math_cuda_test.go		op_math_cuda_test.go
op_math_noextern.go		op_math_noextern.go
op_math_test.go		op_math_test.go
op_nn.go		op_nn.go
op_reduction.go		op_reduction.go
op_reduction_test.go		op_reduction_test.go
op_tensor.go		op_tensor.go
op_tensor_test.go		op_tensor_test.go
op_test.go		op_test.go
operations.go		operations.go
operations_test.go		operations_test.go
operatorLinAlg.go		operatorLinAlg.go
operatorLinAlg_const.go		operatorLinAlg_const.go
operatorPointwise_binary.go		operatorPointwise_binary.go
operatorPointwise_binary_const.go		operatorPointwise_binary_const.go
operatorPointwise_binary_test.go		operatorPointwise_binary_test.go
operatorPointwise_unary.go		operatorPointwise_unary.go
operatorPointwise_unary_const.go		operatorPointwise_unary_const.go
operatorPointwise_unary_test.go		operatorPointwise_unary_test.go
opt.go		opt.go
perf.go		perf.go
perf_test.go		perf_test.go
regalloc.go		regalloc.go
regalloc_test.go		regalloc_test.go
release.go		release.go
shape.go		shape.go
slice.go		slice.go
solvers.go		solvers.go
solvers_test.go		solvers_test.go
stabilization.go		stabilization.go
stabilization_test.go		stabilization_test.go
templates.go		templates.go
testsetup_test.go		testsetup_test.go
type.go		type.go
typeSystem.go		typeSystem.go
typeSystem_test.go		typeSystem_test.go
type_test.go		type_test.go
utils.go		utils.go
values.go		values.go
values_cuda.go		values_cuda.go

Source	How it's Used	Licence
Numpy	Inspired large portions. Directly adapted algorithms for a few methods (explicitly labelled in the docs)	MIT/BSD-like. Numpy Licence
Theano	Inspired large portions. (Unsure: number of directly adapted algorithms)	MIT/BSD-like Theano's licence

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

License

g7n3/gorgonia

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages