Skip to content

Commit 646c927

Browse files
authored
Update roadmap.md
1 parent 89871a2 commit 646c927

File tree

1 file changed

+2
-18
lines changed

1 file changed

+2
-18
lines changed

roadmap.md

Lines changed: 2 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -1,24 +1,8 @@
11
# DataFrame Roadmap
22

3-
## DataFrame-Core
4-
This package should implement the basic functionality of DataFrame as a collection. It should provide all the basic functionality of data frames in that you can find usually in tools like Pandas and R. Some of this functionality is still missing, for example:
3+
In this document, I describe the functionality that we want to add to DataFrame in the nearest future. For the specific features that need to be added, please check the [issues](https://github.com/PolyMathOrg/DataFrame/issues).
4+
55
* Handling missing values
66
* Time series
77

88
We should find time to go over the documentation of pandas and R and create the complete list of everything we don't have.
9-
10-
Another issue with the Core package is querrying. We have implemented a good interface for queries like SELECT or GROUP BY, but this functionality is incomplete. DataFrame should support all the basic queries we have in SQL. To find out what functionality is missing, we should try reproducing some famous examples from SQL tutorials using DataFrame.
11-
12-
It would be perfect if we had something like LINQ.
13-
14-
## DataFrame-Tools
15-
Pretty much all the tools in this package need to be improved, fixed, or redesigned.
16-
17-
__GTInspector__
18-
Right now it shows you a FastTable some basic visualisations. But here is the problem - if you have a DataFrame of textual data, Inspector will try to show you a boxplot in one of its views. I think that DataFrame should automatically detect the kind of data that is stored in it (and not just the classes of values, but also have some "intuition" about the purpose of this data - is it discrete, or does it represent continuous values, like time series? is it a matrix of numbers, for example, a MNIST image, or a list of students with names, grades etc.). When DataFrame has this information about the data, it should change its behaviour accordingly. For example, Inspector must show you statistical data with visualizations (the right visualization could also be detected automatically), textual data in some readable (and searchable / editable) form , images as images etc.
19-
20-
__Visualizations__
21-
I think we need a tool for data visualizations just as powerful as matplotlib. It should be built on top of Roassal, but be specifically designed for visualizing data. The basic idea is this: if you want a boxplot, you say "Hello, DataFrame, show me a boxplot of your second column", you shouldn't write some long script defining axes, boxes and other things. I have implemented some basic visualizations, but some of them don't work as expected (for example, histogram is not a histogram at all). And this functionality is nowhere near matplotlib or ggplot.
22-
23-
__FastTable__
24-
What we have right now is more like a demo than an actual tool. It's broken in so many places, for example, last column is not visible, row names are replaced with numbers, big tables turn into big mess etc. Besides, a lot of important functionality should be introduced. Columns should be resizable and sortable. I think we need to create a separate tool which will have all the functionality of Excel tables and allow you to edit the data through it.

0 commit comments

Comments
 (0)