Skip to content

Commit fab52ec

Browse files
Fixes some minor spelling issues and clumps loose sentences together
1 parent 309238b commit fab52ec

File tree

1 file changed

+3
-11
lines changed

1 file changed

+3
-11
lines changed

content/blog/pragmatism-over-perfection.md

Lines changed: 3 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -8,19 +8,15 @@ weight: 1
88
---
99

1010
We as Engineers often chase perfection.
11-
1211
It fuels our curiosity, sharpens our skills, and makes us feel good about the things we build. But at the same time, it can be a double-edged sword, sometimes slowing us down or distracting us from what actually matters: **delivering business value**.
13-
1412
That’s where pragmatism comes in.
1513

1614
This is a story about me getting that lesson reinforced with a simple task I was working on: KYC Image Tagging.
1715

1816
## The Problem
1917

2018
I had worked with the ML team to develop two models for tagging KYC signature images as **valid** or **invalid** — one based on a CNN, the other on a Decision Tree.
21-
2219
Once trained, we ran both models on a dataset of around **1.2 lakh images**. They disagreed on about **17%** of them — and the only way to figure out which model was better was to manually tag those images and compare.
23-
2420
The ops team was ready to help. All we needed now was a simple interface for them to actually do the tagging.
2521

2622
So I got to work exploring image tagging tools.
@@ -64,19 +60,18 @@ It ticked all our boxes:
6460

6561
My manager signed off, and we presented the options to the stakeholders. They were happy that we could use an existing system to get the job done.
6662
So, we used Databricks as a tagging tool.
67-
6863
It was in no way the "right" tool for the job, but it worked, and that made it the best one.
6964
And now it was time to implement this pragmatic solution.
7065

71-
> What is databricks?
66+
> What is Databricks?
7267
>
73-
> Databricks is an all in one platform for analysts, and engineers to manipulate, process and use data, read more [here](https://www.databricks.com/data-intelligence?scid=7018Y000001f8FIQAY&utm_medium=paid+search&utm_source=google&utm_campaign=20782149301&utm_adgroup=152953302702&utm_content=microsite&utm_offer=data-intelligence&utm_ad=724408738477&utm_term=what%20is%20databricks&gad_source=1&gclid=Cj0KCQjw2ZfABhDBARIsAHFTxGwAa41AMcCUzaTbsL60svmAaD4LReAsmqlwm_SMoJYbKgzcDWwEoGAaAi4wEALw_wcB).
68+
> Databricks is an-all-in one platform for analysts, and engineers to manipulate, process and use data, read more [here](https://www.databricks.com/data-intelligence?scid=7018Y000001f8FIQAY&utm_medium=paid+search&utm_source=google&utm_campaign=20782149301&utm_adgroup=152953302702&utm_content=microsite&utm_offer=data-intelligence&utm_ad=724408738477&utm_term=what%20is%20databricks&gad_source=1&gclid=Cj0KCQjw2ZfABhDBARIsAHFTxGwAa41AMcCUzaTbsL60svmAaD4LReAsmqlwm_SMoJYbKgzcDWwEoGAaAi4wEALw_wcB).
7469
7570
## Getting it up and Running
7671

7772
I onboarded the Ops team onto Databricks and assigned them the right roles, which was quick and easy since I already had admin access.
78-
7973
Then, I created a simple notebook for them.
74+
8075
Here is what the notebook did:
8176

8277
1. Fetched an image path at random from the table we had where the manual tag field was Null.
@@ -94,13 +89,10 @@ And after a short walkthrough, the Ops team was off tagging the images.
9489
## The Outcome
9590

9691
Using existing tools enabled us to get started quickly.
97-
9892
It did introduce a few extra steps for the ops team, like running the notebook cells repeatedly, as compared to the right tools. However, the overhead was minimal, within acceptable limits, and the Ops team found the process simple.
9993

10094
The tagging was completed within **3 weeks**, which gave us the clarity we needed on the model performances.
101-
10295
If I had used the “right” tools, the setup alone would have taken about a week.
103-
10496
In the end, this approach saved us **time, effort, complexity and additional costs** while keeping the stakeholders happy.
10597

10698
P.S. The CNN model won.

0 commit comments

Comments
 (0)