@@ -101,13 +101,12 @@ tabbed to linearly scale to large files.
101101Tablib [ @tablib ] , comma [ @comma ] , pandas [ @pandas ; @mckinney2010 ] and
102102frictionless-py [ @frictionless ] are popular alternative packages to tabbed.
103103\autoref{tbl: table1} compares their respective features. Pandas ` read_csv ` and
104- Frictionless'
105- ` extract ` functions most closely match the available features in tabbed. Both
106- support broad type casting and iterative reading of large files. However, both
107- require specifying the header row if metadata is written to the file. This per
108- file specification of the header location makes batch reading of text files with
109- varying structure difficult. Additionally, neither package stores the skipped
110- metadata section for later use.
104+ Frictionless' ` extract ` functions most closely match the available features in
105+ tabbed. Both support broad type casting and iterative reading of large files.
106+ However, both require specifying the header row if metadata is written to the
107+ file. This per file specification of the header location makes batch reading of
108+ text files with varying structure difficult. Additionally, neither package
109+ stores the skipped metadata section for later use.
111110
112111| ** Software** | ** Structural Detection** | ** Casting** | ** Value-based Filtering** | ** Iterative** |
113112| :----------------:| :------------------------:| :-----------:| :-------------------------:| :-------------:|
@@ -138,7 +137,7 @@ mixed types. The conversion engine for pandas was chosen to be "python". Right:
138137Tabbed's read speed as a function of the number of filters applied to each row
139138during reading. In both panels black circles and error bars are the mean and
140139standard deviation across 30 trials. These comparisons were carried out on
141- a single 2.4 GHz Intel Core i5-6300U processor \label{fig: figure2}] ( figure2.png ) .
140+ a single 2.4 GHz Intel Core i5-6300U processor. \label{fig: figure2}] ( figure2.png )
142141
143142To further understand tabbed's performance, we tested how row filtering impacts
144143read speeds. The right panel of \autoref{fig: figure2} shows the read speed
0 commit comments