Store Finder

This Repo Contains Code to find nearest entities near a given lat, long. The Code is Designed to use in-memory instead of a traditional DB.

How to use this for other projects?

Import NearestLocator class from location_base.py.
Write a subclass which inherits from NearestLocator Class and Override the process_locations Method.
The process_locations method should contain code to get your locations (may be through, s3, file etc) and this method should pass the lat, long to find_nearest_locations method which returns the entities matched within the given radius.

Note: Sample Location DB structure is available in sample_location_db.csv.

How does the code work?

Based on the radius we need, we calculate the min_latitude and max_latitude.
Limit the locations to the latitudes that satisfy the above condition, to make the filtering faster we sort the locations by latitude.
Calculate the distance between qualified locations and the given location, filter ones who do not satisfy the condition. The distance is calculated using Great Circle Distance formula.

How to scale the code?

Vertical scaling can be done by spawning multiple processes and sharing the locations so there is no duplication of locations in memory this is done by using COW (Copy On Write) i.e data is only duplicated in child process if the data is modified, since our location data is read only we can use this. An example for this is already implemented in benchmark/benchmark.py.
Horizontal scaling can be done by splitting the data based on Geography, Ex: A single machine can contain data of just a Continent, Country etc.

Benchmarking the code

All the tools required for benchmarking are available in benchmark directory.

Running a benchmark can be done by running python3 benchmark/benchmark.py

Usage instructions on benchmark settings can be obtained by running python3 benchmark/benchmark.py -h.

Our test location database has around 1.5 million locations. For the purpose of this test, we performed the test on around 10,000 random locations and the results are as follows The time is for a single request.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
benchmark		benchmark
LICENSE		LICENSE
README.md		README.md
location_base.py		location_base.py
requirements.txt		requirements.txt
sample_location_db.csv		sample_location_db.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Store Finder

How to use this for other projects?

How does the code work?

How to scale the code?

Benchmarking the code

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Store Finder

How to use this for other projects?

How does the code work?

How to scale the code?

Benchmarking the code

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages