Skip to content

Commit 5c9e4a0

Browse files
committed
update README.md and FAQ.md
to reflect recent changes and optimizations
1 parent 996d4e4 commit 5c9e4a0

File tree

2 files changed

+13
-5
lines changed

2 files changed

+13
-5
lines changed

FAQ.md

Lines changed: 11 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -13,10 +13,17 @@
1313

1414
## Encoding/decoding is slow in first iteration
1515

16-
Correct. GPUJPEG was intended to provide when running many times (ideally with
17-
a equal-sized pictures, like a video). But using for few or even single image
18-
will not pay off, because there is an initialization burden (let say 230 ms for
19-
a 33 Mpix image).
16+
Correct. This is because the there is initialization of GPUJPEG internal
17+
structures, CUDA buffers, the initialization of GPU execution pipeline
18+
as well as kernel compilation for actual device capability. The last
19+
point can be eliminated by generating code for the particular device
20+
during the compilation:
21+
22+
cmake -DCMAKE_CUDA_ARCHITECTURES=native -DCMAKE_BUILD_TYPE=Release ...
23+
24+
(`all-major` or `all` will also work but the compilation will take longer)
25+
26+
Ideal use case for GPUJPEG is to run for many images (ideally equal-sized).
2027

2128
## What is a restart interval
2229

README.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,8 @@ for high-performance image encoding and decoding. The software runs also on
1212
[ZLUDA.md](ZLUDA.md)).
1313

1414
This documents provides an introduction to the library and how to use it. You
15-
can also look to [FAQ.md](FAQ.md) for additional information. To see _latest changes_
15+
can also look to [FAQ.md](FAQ.md) for _performance tuning_
16+
and additional information. To see _latest changes_
1617
you can display file [NEWS.md](NEWS.md).
1718

1819
Table of contents

0 commit comments

Comments
 (0)