Skip to content

Commit 59b67f3

Browse files
linking for 5 also done - section 6 left only
1 parent 25d1c28 commit 59b67f3

File tree

1 file changed

+17
-12
lines changed

1 file changed

+17
-12
lines changed

Learn.md

Lines changed: 17 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -305,19 +305,20 @@ All the null values were handled for all three data sets and ensured the dataset
305305
| YearsCodePro | 18112 | 0 |
306306
| JobSeek | 2153 | 0 |
307307

308-
# <a name="5 Data Analysis and Visualization">Data Analysis and Visualization</a>
308+
<h1 id="5 Data Analysis and Visualization">Data Analysis and Visualization</h1>
309309

310310
After cleaning and handling outliers in all three datasets, we started looking for valuable insights that we can draw from it.
311311

312312
<img src="https://recodehive.com/wp-content/uploads/2021/05/Message-from-Founder-1024x576.jpg">
313313

314-
## <a name="5.1 Distribution of respondents based on country">5.1) Distribution of respondents based on country</a>
314+
<h2 id="5.1 Distribution of respondents based on country">5.1) Distribution of respondents based on country</h2>
315315

316316
We made use of `plotly` to create a Geoplot showing where the respondents are from and how it's been distributed around the world. We found that most of the respondents are from America.India is in the second position in terms of the number of respondents.
317317

318318
<img src="Data/Images/Geo plot.png">
319319

320-
## <a name="5.2 Impact of participation rate due to different ethnicity">5.2) Impact of participation rate due to different ethnicity</a>
320+
321+
<h2 id="5.2 Impact of participation rate due to different ethnicity">5.2) Impact of participation rate due to different ethnicity</h2>
321322

322323
Consistent with data in all three years, We found that `white or european descent` has the highest participation rate overall.
323324

@@ -336,29 +337,29 @@ for i, v in enumerate(count):
336337

337338
<img src="Data/Images/Ethnicity vs participation.png">
338339

339-
## <a name="5.3 Most popular programming language in three years">5.3) Most popular programming language in three years</a>
340+
<h2 id="5.3 Most popular programming language in three years">5.3) Most popular programming language in three years</h2>
340341

341342
The most popular language that developers worked on between 2018 to 2020 is JavaScript(14%). The second and third highest working language is HTML/CSS(13%) and SQL(11%). JavaScript and SQL had the same steady increasing trend over the three years. The percentage of HTML/CSS was slightly increased from 2018 to 2019. However, it dropped to the same level as 2018 in 2020. Python was responsible for about 9% in 2018. After then, it decreased to 8% in 2019 and it rose 1% in 2020.
342343

343344
There are some languages that were in only 2019; Elixir, Clojure, F#, Web assembly, and Erlang. Perl, Haskell, Julia were in the 2019 and 2020 with small percentages.
344345

345346
<img src="Data/Images/popular language distribution.png">
346347

347-
## <a name="5.4 Distribution of developers based on their developer role">5.4) Distribution of developers based on their developer role</a>
348+
349+
<h2 id="5.4 Distribution of developers based on their developer role">5.4) Distribution of developers based on their developer role</h2>
348350

349351
Most of the respondents were either back-end or full-stack developers.  For those who are working as marketing and sales professionals, their percentage is lowest compare to others.
350352

351353
<img src="Data/Images/devtype distribution.png">
352354

353355

354-
355-
## <a name="5.5 Distribution of respondents based on age">5.5) Distribution of respondents based on age</a>
356+
<h2 id="5.5 Distribution of respondents based on age">5.5) Distribution of respondents based on age</h2>
356357

357358
Most of the respondents are in the age range 25-29. This shows that most of the responents are those who recently joined the comapanies or those who have less than 5 years of experience.
358359

359360
<img src="Data/Images/age distribution.png">
360361

361-
## <a name="5.6 Salary distribution of top ten countries">5.6) Salary distribution of top ten countries</a>
362+
<h2 id="5.6 Salary distribution of top ten countries">5.6) Salary distribution of top ten countries</h2>
362363

363364
Overall, the country which has the highest mean annual salary is the United States of America($240,000) Dollars. The second highest country which provides mean salary is Australia($164,926) Dollars. Though India has a higher number of respondents, it has the lowest mean salary of $25,213 which shows that mean salary of developed country is much higher than the developing countries.
364365

@@ -384,29 +385,33 @@ plt.show()
384385

385386
<img src="Data/Images/salary top ten countries.png">
386387

387-
## <a name="5.7 Analysis of impact of education on salary">5.7) Analysis of impact of education on salary</a>
388+
389+
<h2 id="5.7 Analysis of impact of education on salary">5.7) Analysis of impact of education on salary</h2>
388390

389391
The respondents who have done Doctorate have the highest mean salary among all other education levels. Secondly, the respondents who have done Bachelors degree has more salary than that of Masters degree holders. This may be due to years of professional coding experience and due to the higher number of respondents in that category than that of Masters degree(No of respondents in Bachelor degree is 35659 and number of respondents in masters degree is 16940)
390392

391393
What is interesting is that the respondents who do not have any degree have a mean salary of $90k. This shows the improvement in online learning and advancement of technology that is shifting the company from relying on University degrees.
392394

393395
<img src="Data/Images/salary on edlevel.png">
394396

395-
## <a name="5.8 Gender distribution among top five countries in 2019">5.8) Gender distribution among top five countries in 2019</a>
397+
398+
<h2 id="5.8 Gender distribution among top five countries in 2019">5.8) Gender distribution among top five countries in 2019</h2>
396399

397400
Based on the top 5 countries where the respondents have given the survey, we categorized male and female respondents in those countries.
398401

399402
In terms of male and female statistics, it can be realized that the US has the relatively largest female percentage at about 10.9% followed by Canada and UK at 9.6% and 8.0% respectively. Female respondents were around 5% in India and Germany which is the least among the top 5 counties.
400403

401404
<img src="Data/Images/gender distribution top 5.png">
402405

403-
## <a name="5.9 Where most data scientist came from in 2019?">5.9) Where most data scientist came from in 2019?</a>
406+
<h2 id="5.9 Where most data scientist came from in 2019?">5.9) Where most data scientist came from in 2019?</h2>
404407

405408
There are 5,788 data scientists who responded to the Stackoverflow survey in `2019`. Most data scientists are from the US with 1,550 people and it is 3 times higher than data scientists from India. Followed by Germany and the UK with 427 and 339 people respectively. The rest are Canada, France, Netherlands, Brazil, Russia, and Australia which have less than 200 data scientists.
406409

407410
<img src="Data/Images/DS_top contries.png">
408411

409-
## <a name="5.10 Countries which pays the most for data scientist in 2019">5.10) Countries which pays the most for data scientist in 2019</a>
412+
413+
<h2 id="5.10 Countries which pays the most for data scientist in 2019">5.10) Countries which pays the most for data scientist in 2019</h2>
414+
410415

411416
In 2019, the top three countries which have a highest mean annual salary of a data scientist are Ireland (`$275,851`), Luxembourg (​`$272,769`), and the USA (`$265,211`). Apart from that, the mean salary of the rest countries is less than (`$200,000`) per year. Japan provides the highest mean annual salary among Asian countries (`$118,969`)
412417

0 commit comments

Comments
 (0)