Skip to content

Commit 97dd08c

Browse files
Write up changelog for 0.2.0
1 parent e790049 commit 97dd08c

File tree

2 files changed

+74
-5
lines changed

2 files changed

+74
-5
lines changed

docs/tutorial.rst

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -829,7 +829,7 @@ Computing statistics
829829

830830
Tskit provides an extensive and flexible interface for computing population
831831
genetic statistics, which is documented in detail in the :ref:`general statistics
832-
<sec_general_stats>` section. This tutorial aims to give a quick overview of
832+
<sec_stats>` section. This tutorial aims to give a quick overview of
833833
how the APIs work how to use them effectively.
834834

835835
First, lets simulate a tree sequence to work with which has roughly human
@@ -859,7 +859,7 @@ is computed using the :meth:`.TreeSequence.diversity` method::
859859

860860
This tells the average diversity across the whole sequence and returns a single
861861
number. We'll usually want to compute statistics in
862-
:ref:`windows <sec_general_stats_windowing>` along the genome and we
862+
:ref:`windows <sec_stats_windows>` along the genome and we
863863
use the ``windows`` argument to do this::
864864

865865
windows = np.linspace(0, ts.sequence_length, num=5)
@@ -877,7 +877,7 @@ along the genome. Here, we use numpy to create four equally spaced windows
877877
of size 2.5 megabases (the windows array contains k + 1 elements to define
878878
k windows). Because we have asked for values in windows, tskit now returns
879879
a numpy array rather than a single value. (See
880-
:ref:`sec_general_stats_output_dimensions` for a full description of how the output
880+
:ref:`sec_stats_output_dimensions` for a full description of how the output
881881
dimensions of statistics are determined by the ``windows`` argument.)
882882

883883
Suppose we wanted to compute diversity within a specific subset of samples.
@@ -911,7 +911,7 @@ Because we've computed multiple statistics concurrently, tskit returns a numpy a
911911
of these statistics. We have asked for diversity within three different sample sets,
912912
and tskit therefore returns an array with three values. (In general, the
913913
dimensions of the input determines the dimensions of the output: see
914-
:ref:`sec_general_stats_output_dimensions` for a detailed description of the rules.)
914+
:ref:`sec_stats_output_dimensions` for a detailed description of the rules.)
915915

916916
We can also compute multiple statistics in multiple windows::
917917

@@ -1020,7 +1020,7 @@ output as an array with one value if we wish::
10201020

10211021
[0.00039765]
10221022

1023-
Please see :ref:`sec_general_stats_sample_sets` for a
1023+
Please see :ref:`sec_stats_sample_sets` for a
10241024
full description of the ``sample_sets`` and ``indexes`` arguments.
10251025

10261026
.. _sec_tutorial_afs:

python/CHANGELOG.rst

Lines changed: 69 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -13,15 +13,82 @@ In development.
1313
from numpy "u8" to "i8". This should not affect client code in any way
1414
unless it specifically depends on the type of the returned numpy array.
1515

16+
- The VCF written by the ``write_vcf`` is no longer compatible with previous
17+
versions, which had significant shortcomings. Position values are now rounded
18+
to the nearest integer by default, REF and ALT values are derived from the
19+
actual allelic states (rather than always being A and T). Sample names
20+
are now of the form ``tsk_j`` for sample ID j. Most of the legacy behaviour
21+
can be recovered with new options, however.
22+
1623
- The positional parameter ``reference_sets`` in ``genealogical_nearest_neighbours``
1724
and ``mean_descendants`` TreeSequence methods has been renamed to
1825
``sample_sets``.
1926

2027
**New features**
2128

29+
- Support for general windowed statistics. Implementations of diversity,
30+
divergence, segregating sites, Tajima's D, Fst, Patterson's F statistics,
31+
Y statistics, trait correlations and covariance, and k-dimensional allele
32+
frequency specra (:user:`petrelharp`, :user:`jeromekelleher`, :user:`molpopgen`).
33+
2234
- Add the ``keep_unary`` option to simplify (:user:`gtsambos`). See :issue:`1`
2335
and :pr:`143`.
2436

37+
- Add the ``map_ancestors`` method to TableCollection (user:`gtsambos`). See :pr:`175`.
38+
39+
- Add the ``squash`` method to EdgeTable (:user:`gtsambos`). See :issue:`59` and
40+
:pr:`285`.
41+
42+
- Add support for individuals to VCF output, and fix major issues with output
43+
format (:user:`jeromekelleher`). Position values are transformed in a much
44+
more straightforward manner and output has been generalised substantially.
45+
Adds ``individual_names`` and ``position_transform`` arguments.
46+
See :pr:`286`, and issues :issue:`2`, :issue:`30` and :issue:`73`.
47+
48+
- Control height scale in SVG trees using 'tree_height_scale' and 'max_tree_height'
49+
(:user:`hyanwong`, :user:`jeromekelleher`). See :issue:`167`, :pr:`168`.
50+
Various other improvements to tree drawing (:pr:`235`, :pr:`241`, :pr:`242`,
51+
:pr:`252`, :pr:`259`).
52+
53+
- Add ``Tree.max_root_time`` property (:user:`hyanwong`, :user:`jeromekelleher`).
54+
See :pr:`170`.
55+
56+
- Improved input checking on various methods taking numpy arrays as parameters
57+
(:user:`hyanwong`). See :issue:`8` and :pr:`185`.
58+
59+
- Define the branch length over roots in trees to be zero (previously raise
60+
an error; :user:`jeromekelleher`). See :issue:`188` and :pr:`191`.
61+
62+
- Implementation of the genealogical nearest neighbours statistic
63+
(:user:`hyanwong`, :user:`jeromekelleher`).
64+
65+
- New ``delete_intervals`` and ``keep_intervals`` method for the TableCollection
66+
to allow slicing out of topology from specific intervals (:user:`hyanwong`,
67+
:user:`andrewkern`, :user:`petrelharp`, :user:`jeromekelleher`). See
68+
:pr:`225` and :pr:`261`.
69+
70+
- Support for missing data via a topological definition (:user:`jeromekelleher`).
71+
See :issue:`270` and :pr:`272`.
72+
73+
- Add ability to set columns directly in the Tables API (:user:`jeromekelleher`).
74+
See :issue:`12` and :pr:`307`.
75+
76+
- Various documentation improvements from :user:`brianzhang`, :user:`hyanwong`,
77+
:user:`petrelharp` and :user:`jeromekelleher`.
78+
79+
**Deprecated**
80+
81+
- Deprecate ``Tree.length`` in favour of ``Tree.span`` (:user:`hyanwong`).
82+
See :pr:`169`.
83+
84+
- Deprecate ``TreeSequence.pairwise_diversity`` in favour of the new
85+
``diversity`` method. See :issue:`215`, :pr:`312`.
86+
87+
**Bugfixes**
88+
89+
- Catch NaN and infinity values within tables (:user:`hyanwong`).
90+
See :issue:`293` and :pr:`294`.
91+
2592
--------------------
2693
[0.1.5] - 2019-03-27
2794
--------------------
@@ -45,10 +112,12 @@ new ``tskit`` command line interface.
45112

46113
- Fixes to the low-level C API (:issue:`132` and :issue:`157`)
47114

115+
48116
--------------------
49117
[0.1.4] - 2019-02-01
50118
--------------------
51119

120+
52121
Minor feature update. Using the C API 0.99.1.
53122

54123
**New features**

0 commit comments

Comments
 (0)