ruby-concurrency
diff --git a/‎README.md
Lines changed: 10 additions & 10 deletions b/‎README.md
Lines changed: 10 additions & 10 deletions
diff --git a/‎doc/tvar.md
Lines changed: 78 additions & 27 deletions b/‎doc/tvar.md
Lines changed: 78 additions & 27 deletions
diff --git a/‎lib/concurrent/agent.rb
Lines changed: 16 additions & 5 deletions b/‎lib/concurrent/agent.rb
Lines changed: 16 additions & 5 deletions
diff --git a/‎lib/concurrent/tvar.rb
Lines changed: 15 additions & 23 deletions b/‎lib/concurrent/tvar.rb
Lines changed: 15 additions & 23 deletions
diff --git a/‎lib/concurrent/utility/timeout.rb
Lines changed: 10 additions & 13 deletions b/‎lib/concurrent/utility/timeout.rb
Lines changed: 10 additions & 13 deletions
@@ -199,22 +199,22 @@ any platform. *Documentation is forthcoming...*
 
 ```
 *MRI only*
-rake build:native       # Build concurrent-ruby-ext-<version>-<platform>.gem into the pkg directory
-rake compile:extension  # Compile extension
+bundle exec rake build:native       # Build concurrent-ruby-ext-<version>-<platform>.gem into the pkg dir
+bundle exec rake compile:extension  # Compile extension
 
 *JRuby only*
-rake build              # Build JRuby-specific core gem (alias for `build:core`)
-rake build:core         # Build concurrent-ruby-<version>-java.gem into the pkg directory
+bundle exec rake build              # Build JRuby-specific core gem (alias for `build:core`)
+bundle exec rake build:core         # Build concurrent-ruby-<version>-java.gem into the pkg directory
 
 *All except JRuby*
-rake build              # Build core and extension gems
-rake build:core         # Build concurrent-ruby-<version>.gem into the pkg directory
-rake build:ext          # Build concurrent-ruby-ext-<version>.gem into the pkg directory
+bundle exec rake build              # Build core and extension gems
+bundle exec rake build:core         # Build concurrent-ruby-<version>.gem into the pkg directory
+bundle exec rake build:ext          # Build concurrent-ruby-ext-<version>.gem into the pkg directory
 
 *All*
-rake clean              # Remove any temporary products
-rake clobber            # Remove any generated file
-rake compile            # Compile all the extensions
+bundle exec rake clean              # Remove any temporary products
+bundle exec rake clobber            # Remove any generated file
+bundle exec rake compile            # Compile all the extensions
 ```
 
 ## Maintainers
 
@@ -1,10 +1,10 @@
-`TVar` and `atomically` implement a software transactional memory. A `TVar` is a single
-item container that always contains exactly one value. The `atomically` method
-allows you to modify a set of `TVar` objects with the guarantee that all of the
-updates are collectively atomic - they either all happen or none of them do -
-consistent - a `TVar` will never enter an illegal state - and isolated - atomic
-blocks never interfere with each other when they are running. You may recognise
-these properties from database transactions.
+`TVar` and `atomically` implement a software transactional memory. A `TVar` is a
+single item container that always contains exactly one value. The `atomically`
+method allows you to modify a set of `TVar` objects with the guarantee that all
+of the updates are collectively atomic - they either all happen or none of them
+do - consistent - a `TVar` will never enter an illegal state - and isolated -
+atomic blocks never interfere with each other when they are running. You may
+recognise these properties from database transactions.
 
 There are some very important and unusual semantics that you must be aware of:
 
@@ -24,7 +24,10 @@ We implement nested transactions by flattening.
 We only support strong isolation if you use the API correctly. In order words,
 we do not support strong isolation.
 
-Our implementation uses a very simple two-phased locking with versioned locks algorithm, as per [1]. In the future we will look at more advanced algorithms, contention management and using existing Java implementations when in JRuby.
+Our implementation uses a very simple two-phased locking with versioned locks
+algorithm and lazy writes, as per [1]. In the future we will look at more
+advanced algorithms, contention management and using existing Java
+implementations when in JRuby.
 
 See:
 
@@ -150,46 +153,94 @@ repeated execution.
 
 ## Evaluation
 
-We evaluated the performance of our `TVar` implementation using a bank account simulation with a range of synchronisation implementations. The simulation maintains a set of bank account totals, and runs transactions that either get a summary statement of multiple accounts (a read-only operation) or transfers a sum from one account to another (a read-write operation).
-
-We implemented a bank that does not use any synchronisation (and so creates inconsistent totals in accounts), one that uses a single global (or 'coarse') lock (and so won't scale at all), one that uses one lock per account (and so has a complicated system for locking in the correct order) and one using our `TVar` and `atomically`.
-
-We ran 1 million transactions divided equally between a varying number of threads on a system that has at least that many physical cores. The transactions are made up of a varying mixture of read-only and read-write transactions. We ran each set of transactions thirty times, discarding the first ten and then taking an algebraic mean. These graphs show only the simple mean. Our `tvars-experiments` branch includes the benchmark used, full details of the test system, and all the raw data.
-
-Using JRuby using 75% read-write transactions, we can compare how the different implementations of bank accounts scales to more cores. That is, how much faster it runs if you use more cores.
+We evaluated the performance of our `TVar` implementation using a bank account
+simulation with a range of synchronisation implementations. The simulation
+maintains a set of bank account totals, and runs transactions that either get a
+summary statement of multiple accounts (a read-only operation) or transfers a
+sum from one account to another (a read-write operation).
+
+We implemented a bank that does not use any synchronisation (and so creates
+inconsistent totals in accounts), one that uses a single global (or 'coarse')
+lock (and so won't scale at all), one that uses one lock per account (and so has
+a complicated system for locking in the correct order) and one using our `TVar`
+and `atomically`.
+
+We ran 1 million transactions divided equally between a varying number of
+threads on a system that has at least that many physical cores. The transactions
+are made up of a varying mixture of read-only and read-write transactions. We
+ran each set of transactions thirty times, discarding the first ten and then
+taking an algebraic mean. These graphs show only the simple mean. Our `tvars-
+experiments` branch includes the benchmark used, full details of the test
+system, and all the raw data.
+
+Using JRuby using 75% read-write transactions, we can compare how the different
+implementations of bank accounts scales to more cores. That is, how much faster
+it runs if you use more cores.
 
 ![](https://raw.githubusercontent.com/ruby-concurrency/concurrent-ruby/master/doc/images/tvar/implementation-scalability.png)
 
-We see that the coarse lock implementation does not scale at all, and in fact with more cores only wastes more time in contention for the single global lock. We see that the unsynchronised implementation doesn't seem to scale well - which is strange as there should be no overhead, but we'll explain that in a second. We see that the fine lock implementation seems to scale better, and that the `TVar` implementation scales the best.
+We see that the coarse lock implementation does not scale at all, and in fact
+with more cores only wastes more time in contention for the single global lock.
+We see that the unsynchronised implementation doesn't seem to scale well - which
+is strange as there should be no overhead, but we'll explain that in a second.
+We see that the fine lock implementation seems to scale better, and that the
+`TVar` implementation scales the best.
 
 So the `TVar` implementation *scales* very well, but how absolutely fast is it?
 
 ![](https://raw.githubusercontent.com/ruby-concurrency/concurrent-ruby/master/doc/images/tvar/implementation-absolute.png)
 
-Well, that's the downside. The unsynchronised implementation doesn't scale well because it's so fast in the first place, and probably because we're bound on access to the memory - the threads don't have much work to do, so no matter how many threads we have the system is almost always reaching out to the L3 cache or main memory. However remember that the unsynchronised implementation isn't correct - the totals are wrong at the end. The coarse lock implementation has an overhead of locking and unlocking. The fine lock implementation has a greater overhead as as the locking scheme is complicated to avoid deadlock. It scales better, however, actually allowing transactions to be processed in parallel. The `TVar` implementation has a greater overhead still - and it's pretty huge. That overhead is the cost for the simple programming model of an atomic block.
-
-So that's what `TVar` gives you at the moment - great scalability, but it has a high overhead. That's pretty much the state of software transactional memory in general. Perhaps hardware transactional memory will help us, or perhaps we're happy anyway with the simpler and safer programming model that the `TVar` gives us.
-
-We can also use this experiment to compare different implementations of Ruby. We looked at just the `TVar` implementation and compared MRI 2.1.1, Rubinius 2.2.6, and JRuby 1.7.11, again at 75% write transactions.
+Well, that's the downside. The unsynchronised implementation doesn't scale well
+because it's so fast in the first place, and probably because we're bound on
+access to the memory - the threads don't have much work to do, so no matter how
+many threads we have the system is almost always reaching out to the L3 cache or
+main memory. However remember that the unsynchronised implementation isn't
+correct - the totals are wrong at the end. The coarse lock implementation has an
+overhead of locking and unlocking. The fine lock implementation has a greater
+overhead as as the locking scheme is complicated to avoid deadlock. It scales
+better, however, actually allowing transactions to be processed in parallel. The
+`TVar` implementation has a greater overhead still - and it's pretty huge. That
+overhead is the cost for the simple programming model of an atomic block.
+
+So that's what `TVar` gives you at the moment - great scalability, but it has a
+high overhead. That's pretty much the state of software transactional memory in
+general. Perhaps hardware transactional memory will help us, or perhaps we're
+happy anyway with the simpler and safer programming model that the `TVar` gives
+us.
+
+We can also use this experiment to compare different implementations of Ruby. We
+looked at just the `TVar` implementation and compared MRI 2.1.1, Rubinius 2.2.6,
+and JRuby 1.7.11, again at 75% write transactions.
 
 ![](https://raw.githubusercontent.com/ruby-concurrency/concurrent-ruby/master/doc/images/tvar/ruby-scalability.png)
 
-We see that MRI provides no scalability, due to the global interpreter lock (GIL). JRuby seems to scale better than Rubinius for this workload (there are of course other workloads).
+We see that MRI provides no scalability, due to the global interpreter lock
+(GIL). JRuby seems to scale better than Rubinius for this workload (there are of
+course other workloads).
 
-As before we should also look at the absolute performance, not just the scalability.
+As before we should also look at the absolute performance, not just the
+scalability.
 
 ![](https://raw.githubusercontent.com/ruby-concurrency/concurrent-ruby/master/doc/images/tvar/ruby-absolute.png)
 
-Again, JRuby seems to be faster than Rubinius for this experiment. Interestingly, Rubinius looks slower than MRI for 1 core, but we can get around that by using more cores.
+Again, JRuby seems to be faster than Rubinius for this experiment.
+Interestingly, Rubinius looks slower than MRI for 1 core, but we can get around
+that by using more cores.
 
-We've used 75% read-write transactions throughout. We'll just take a quick look at how the scalability varies for different workloads, for scaling between 1 and 2 threads. We'll admit that we used 75% read-write just because it emphasised the differences.
+We've used 75% read-write transactions throughout. We'll just take a quick look
+at how the scalability varies for different workloads, for scaling between 1 and
+2 threads. We'll admit that we used 75% read-write just because it emphasised
+the differences.
 
 ![](https://raw.githubusercontent.com/ruby-concurrency/concurrent-ruby/master/doc/images/tvar/implementation-write-proportion-scalability.png)
 
-Finally, we can also run on a larger machine. We repeated the experiment using a machine with 64 physical cores and JRuby.
+Finally, we can also run on a larger machine. We repeated the experiment using a
+machine with 64 physical cores and JRuby.
 
 ![](https://raw.githubusercontent.com/ruby-concurrency/concurrent-ruby/master/doc/images/tvar/implementation-scalability.png)
 
 ![](https://raw.githubusercontent.com/ruby-concurrency/concurrent-ruby/master/doc/images/tvar/implementation-absolute.png)
 
-Here you can see that `TVar` does become absolutely faster than using a global lock, at the slightly ridiculously thread-count of 50. It's probably not statistically significant anyway.
+Here you can see that `TVar` does become absolutely faster than using a global
+lock, at the slightly ridiculously thread-count of 50. It's probably not
+statistically significant anyway.
@@ -2,7 +2,6 @@
 
 require 'concurrent/dereferenceable'
 require 'concurrent/observable'
-require 'concurrent/utility/timeout'
 require 'concurrent/logging'
 
 module Concurrent
@@ -64,6 +63,7 @@ def rescue(clazz = StandardError, &block)
       end
       self
     end
+
     alias_method :catch, :rescue
     alias_method :on_error, :rescue
 
@@ -87,6 +87,7 @@ def validate(&block)
       end
       self
     end
+
     alias_method :validates, :validate
     alias_method :validate_with, :validate
     alias_method :validates_with, :validate
@@ -106,20 +107,30 @@ def post(&block)
     # Update the current value with the result of the given block fast,
     # block can do blocking calls
     #
-    # @param [Fixnum, nil] timeout maximum number of seconds before an update is cancelled
+    # @param [Fixnum, nil] timeout [DEPRECATED] maximum number of seconds before an update is cancelled
     #
     # @yield the fast to be performed with the current value in order to calculate
     #   the new value
     # @yieldparam [Object] value the current value
     # @yieldreturn [Object] the new value
     # @return [true, nil] nil when no block is given
     def post_off(timeout = nil, &block)
-      block = if timeout
-                lambda { |value| Concurrent::timeout(timeout) { block.call(value) } }
+      warn '[DEPRECATED] post_off with timeout options is deprecated and will be removed'
+      task = if timeout
+                lambda do |value|
+                  future = Future.execute do
+                    block.call(value)
+                  end
+                  if future.wait(timeout)
+                    future.value!
+                  else
+                    raise Concurrent::TimeoutError
+                  end
+                end
               else
                 block
               end
-      post_on(@io_executor, &block)
+      post_on(@io_executor, &task)
     end
 
     # Update the current value with the result of the given block fast,
 
@@ -153,37 +153,36 @@ class Transaction
     ABORTED = Object.new
 
     ReadLogEntry = Struct.new(:tvar, :version)
-    UndoLogEntry = Struct.new(:tvar, :value)
 
     AbortError = Class.new(StandardError)
 
     def initialize
-      @write_set = Set.new
       @read_log  = []
-      @undo_log  = []
+      @write_log = {}
     end
 
     def read(tvar)
       Concurrent::abort_transaction unless valid?
-      @read_log.push(ReadLogEntry.new(tvar, tvar.unsafe_version))
-      tvar.unsafe_value
+
+      if @write_log.has_key? tvar
+        @write_log[tvar]
+      else
+        @read_log.push(ReadLogEntry.new(tvar, tvar.unsafe_version))
+        tvar.unsafe_value
+      end
     end
 
     def write(tvar, value)
       # Have we already written to this TVar?
 
-      unless @write_set.include? tvar
+      unless @write_log.has_key? tvar
         # Try to lock the TVar
 
         unless tvar.unsafe_lock.try_lock
           # Someone else is writing to this TVar - abort
           Concurrent::abort_transaction
         end
 
-        # We've locked it - add it to the write set
-
-        @write_set.add(tvar)
-
         # If we previously wrote to it, check the version hasn't changed
 
         @read_log.each do |log_entry|
@@ -193,27 +192,20 @@ def write(tvar, value)
         end
       end
 
-      # Record the current value of the TVar so we can undo it later
+      # Record the value written
 
-      @undo_log.push(UndoLogEntry.new(tvar, tvar.unsafe_value))
-
-      # Write the new value to the TVar
-
-      tvar.unsafe_value = value
+      @write_log[tvar] = value
     end
 
     def abort
-      @undo_log.each do |entry|
-        entry.tvar.unsafe_value = entry.value
-      end
-
       unlock
     end
 
     def commit
       return false unless valid?
 
-      @write_set.each do |tvar|
+      @write_log.each_pair do |tvar, value|
+        tvar.unsafe_value = value
         tvar.unsafe_increment_version
       end
 
@@ -224,7 +216,7 @@ def commit
 
     def valid?
       @read_log.each do |log_entry|
-        unless @write_set.include? log_entry.tvar
+        unless @write_log.has_key? log_entry.tvar
           if log_entry.tvar.unsafe_version > log_entry.version
             return false
           end
@@ -235,7 +227,7 @@ def valid?
     end
 
     def unlock
-      @write_set.each do |tvar|
+      @write_log.each_key do |tvar|
         tvar.unsafe_lock.unlock
       end
     end
 
@@ -5,12 +5,13 @@
 
 module Concurrent
 
-  # Wait the given number of seconds for the block operation to complete.
+  # [DEPRECATED] Wait the given number of seconds for the block operation to complete.
   # Intended to be a simpler and more reliable replacement to the Ruby
-  # standard library `Timeout::timeout` method.
+  # standard library `Timeout::timeout` method. It does not kill the task
+  # so it finishes anyway. Advantage is that it cannot cause any ugly errors by
+  # killing threads.
   #
   # @param [Integer] seconds The number of seconds to wait
-  #
   # @return [Object] The result of the block operation
   #
   # @raise [Concurrent::TimeoutError] when the block operation does not complete
@@ -19,20 +20,16 @@ module Concurrent
   # @see http://ruby-doc.org/stdlib-2.2.0/libdoc/timeout/rdoc/Timeout.html Ruby Timeout::timeout
   #
   # @!macro monotonic_clock_warning
-  def timeout(seconds)
-
-    thread = Thread.new do
-      Thread.current[:result] = yield
-    end
-    success = thread.join(seconds)
+  def timeout(seconds, &block)
+    warn '[DEPRECATED] timeout is deprecated and will be removed'
 
-    if success
-      return thread[:result]
+    future = Future.execute(&block)
+    future.wait(seconds)
+    if future.complete?
+      future.value!
     else
       raise TimeoutError
     end
-  ensure
-    Thread.kill(thread) unless thread.nil?
   end
   module_function :timeout
 end