Skip to content

Commit 903f1c0

Browse files
added requirements and preparations for version 0.1.0
1 parent 9bf904b commit 903f1c0

File tree

5 files changed

+64
-5
lines changed

5 files changed

+64
-5
lines changed

CHANGELOG.md

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3,14 +3,24 @@ All notable changes to this project will be documented in this file.
33
This project adheres to [Semantic Versioning](http://semver.org/).
44

55
## [Unreleased]
6+
7+
8+
9+
10+
## [0.1.0] - 2016-11-02
611
### Changed
712
- verbose now also prints debug output when correctness check fails
13+
- restructured the utility functions into util and core
814
- restructured the code to prepare for different strategies
915
- shortened the output printed by the tune_kernel
1016
- allowing numpy integers for specifying problem size
1117

1218
### Added
19+
- a public roadmap
20+
- requirements.txt
21+
- example showing GPU code unit testing with the Kernel Tuner
1322
- support for passing a (list of) filenames instead of kernel string
23+
- runner that takes a random sample of 10 percent
1424
- support for OpenCL platform selection
1525
- support for using tuning parameter names in the problem size
1626

doc/source/conf.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -57,9 +57,9 @@
5757
# built documents.
5858
#
5959
# The short X.Y version.
60-
version = u'0.0.1'
60+
version = u'0.1.0'
6161
# The full version, including alpha/beta/rc tags.
62-
release = u'0.0.1'
62+
release = u'0.1.0'
6363

6464
# The language for content autogenerated by Sphinx. Refer to documentation
6565
# for a list of supported languages.

requirements.txt

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
mock>=2.0.0
2+
nose>=1.3.7
3+
numpy>=1.7.1
4+
pycuda>=2016.1.1
5+
pyopencl>=2015.2.4
6+

roadmap.md

Lines changed: 38 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,38 @@
1+
# Roadmap for the Kernel Tuner
2+
3+
This roadmap presents an overview of the features we are currently planning to
4+
implement. Please note that this is a living document that will evolve as
5+
priorities grow and shift.
6+
7+
### version 0.2.0
8+
9+
This is the list of features that we want to have implemented by the next version.
10+
11+
* Option to store tuning results in a file (e.g. json, csv, ... )
12+
* Option to set a function that performs output verfication, instead of numpy.allclose()
13+
* Option to change defaults for 'block_size_x', and so on
14+
* Option to set a function that computes search space restriction, instead of a list of strings
15+
* Option to set compiler name, when using C backend
16+
* Option to set compiler options
17+
18+
### version 1.0.0
19+
20+
These functions are to be implemented by version 1.0.0, but may already be
21+
implemented in earlier versions.
22+
23+
* Tuning kernels in parallel on a single node
24+
* Tuning kernels in parallel on a set of nodes in a GPU clusters
25+
* Tuning kernels using machine learning or search strategies
26+
* Store tuning results in a database and provide an API for analysis
27+
28+
### Low priority
29+
30+
These are the things that we would like to implement, but we currently have no
31+
demand for it. If you are interesting in any of these, let us know!
32+
33+
* Tuning compiler options in combination other parameters kernel
34+
* Example that tunes a kernel using thread block re-indexing
35+
* Example host code that runs a pipeline of kernels
36+
* Example CUDA host code that uses runtime compilation
37+
38+

setup.py

Lines changed: 8 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,17 +1,20 @@
11
import os
22
from setuptools import setup
33

4+
with open('requirements.txt') as f:
5+
required = f.read().splitlines()
6+
47
def read(fname):
58
return open(os.path.join(os.path.dirname(__file__), fname)).read()
69

710
setup(
811
name = "kernel_tuner",
9-
version = "0.0.1",
12+
version = "0.1.0",
1013
author = "Ben van Werkhoven",
1114
author_email = "[email protected]",
12-
description = ("A simple CUDA kernel tuner in Python"),
15+
description = ("A simple CUDA/OpenCL kernel tuner in Python"),
1316
license = "Apache 2.0",
14-
keywords = "auto-tuning gpu pycuda cuda pyopencl opencl",
17+
keywords = "auto-tuning gpu computing pycuda cuda pyopencl opencl",
1518
url = "http://benvanwerkhoven.github.io/kernel_tuner/",
1619
packages=['kernel_tuner', 'kernel_tuner.runners'],
1720
long_description=read('README.md'),
@@ -30,5 +33,7 @@ def read(fname):
3033
'Topic :: System :: Distributed Computing',
3134
'Development Status :: 4 - Beta',
3235
],
36+
install_requires=required,
37+
3338
)
3439

0 commit comments

Comments
 (0)