Dilly Dally GET data optimization with BIG DATA - Pagination vs get-everything-at-once

# Problem

We are currently developing a intelligent time registration system (***DillyDally***), where a employee of a given company can log their workhours and the system should create a nice overview of all worklogs. We are hosting our project in Google Firebase, and we are storing all our data in the NoSQL document database Firestore. At average one employee at Prolike submits 5-10 logs per day, it means the data will grow fast. If a company have 1k employees, they will generate approximately 2,6 million logs annually.


### Problem 1: 
The problem is we need to figure out how to fetch ALL worklogs from firestore, and the performance is the core requirement here. We need to analysis to how we can fetch 50k, 500k, 5 millions and 50 millions document smoothly. 


### Problem 2:*
If we dont fetch ALL data at once, how should we be able to calculate the sum of all workhours effectively?

# Features

### A comparison between scenarios measured in time
- Fetching ALL data at once
- Fetching data with pagination ( 100 at once )
_ Your suggestion?

### Benchmark
We should figure out when exactly it doesn't longer pay off to fetch all data at once compared to pagination. 

# Values

With this analysis, our system will now support a large-amount of entrietes, and now we can sell our system to customers.

# Materials

- https://firebase.google.com/docs/firestore/query-data/query-cursors
- https://firebase.google.com/docs/firestore


 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Dilly Dally GET data optimization with BIG DATA - Pagination vs get-everything-at-once #13

Problem

Problem 1:

Problem 2:*

Features

A comparison between scenarios measured in time

Benchmark

Values

Materials

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Dilly Dally GET data optimization with BIG DATA - Pagination vs get-everything-at-once #13

Description

Problem

Problem 1:

Problem 2:*

Features

A comparison between scenarios measured in time

Benchmark

Values

Materials

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions