Clever Challenge - Paul-Andre Henegar by Paul-Andre · Pull Request #4 · MathieuNls/clever-challenge

Paul-Andre · 2018-04-02T18:14:21Z

Api change

I changed the result struct slightly. In the functionCalls member, I count the number of function calls before and after the diff separately, like functionCalls map[string]struct{ before, after int }.

Approach

My approach for the first 4 parts is relatively straightforward, with the only slightly interesting thing being that I used state functions (inspired by https://talks.golang.org/2011/lex.slide#1) and closures.

My approach for counting function calls was reading in the regions one at a time, tokenizing them and deciding if sequences of tokens are considered to be function calls. I read each region into two buffers, one for the before version, one for the after version, and counted function calls separately.

My "tokenizer" is really basic, and only distinguishes what looks like identifiers from whitespace and other characters. It does not treat comments or strings correctly, so my code would currently count a function call inside a comment. The tokenizer could be extended or replaced by an actual tokenizer without any problem. I tried using a regexp based tokenizer at first and it was extremely slow.

The function calls are counted differently depending on the language and I don't count them if the language is unknown. Right now, I only implemented basic function call counting for C (.c, .h) and Python (.py). For both languages, I keep a window of 3 tokens and check if the second token is an identifier, the third token is a '(', and the first token is not something that could indicate a function definition.

Speed

On my computer, the solution runs in about 310ms when I print the info and 120ms when I don't.

And the values of the map are struct{before, after int}

Tokenizer now properly returns identifiers after whitespace Now correctly considers the file extension for both source and destination file Now consider /dev/null files

Paul-Andre added 13 commits April 1, 2018 01:23

Done the first four parts

eb46baa

Added C function call counter, with horrible performance

0c5fc70

Removed regexps, now performance isn't as horrible

1d648ab

Duplicated the C function call counter function and adapted for python

d51e077

Factored ignoring whitespace into the tokenizer

7dd90f5

now using state functions for reading the diff files

26a4e1f

Now result only has a single functionCalls map

1d7257e

And the values of the map are struct{before, after int}

Fixed some bugs

8213243

Tokenizer now properly returns identifiers after whitespace Now correctly considers the file extension for both source and destination file Now consider /dev/null files

Move state functions outside the loop and do keyword checks for python

be0dbc5

Put everything related to counting function calls into a different file

c437465

Made some things slightly more clear

743bf34

Fixed processing last region; some comments

2a6677b

Removed extension list from result

e1024b6

MathieuNls mentioned this pull request Sep 19, 2018

Python Solution #7

Open

MathieuNls mentioned this pull request Sep 26, 2018

Assignment clever-challenge. #8

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clever Challenge - Paul-Andre Henegar#4

Clever Challenge - Paul-Andre Henegar#4
Paul-Andre wants to merge 13 commits intoMathieuNls:masterfrom
Paul-Andre:master

Paul-Andre commented Apr 2, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Paul-Andre commented Apr 2, 2018

Api change

Approach

Speed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant