Uncodedtech
diff --git a/‎1-Introduction/01-defining-data-science/notebook.ipynb‎
Lines changed: 2 additions & 2 deletions b/‎1-Introduction/01-defining-data-science/notebook.ipynb‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎1-Introduction/01-defining-data-science/translations/README.es.md‎
Lines changed: 171 additions & 0 deletions b/‎1-Introduction/01-defining-data-science/translations/README.es.md‎
Lines changed: 171 additions & 0 deletions
diff --git a/‎1-Introduction/01-defining-data-science/translations/assignment.es.md‎
Lines changed: 32 additions & 0 deletions b/‎1-Introduction/01-defining-data-science/translations/assignment.es.md‎
Lines changed: 32 additions & 0 deletions
diff --git a/‎1-Introduction/03-defining-data/README.md‎
Lines changed: 2 additions & 2 deletions b/‎1-Introduction/03-defining-data/README.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎1-Introduction/04-stats-and-probability/README.md‎
Lines changed: 3 additions & 3 deletions b/‎1-Introduction/04-stats-and-probability/README.md‎
Lines changed: 3 additions & 3 deletions
diff --git a/‎1-Introduction/translations/README.es.md‎
Lines changed: 1 addition & 1 deletion b/‎1-Introduction/translations/README.es.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎1-Introduction/translations/README.fa.md‎
Lines changed: 21 additions & 0 deletions b/‎1-Introduction/translations/README.fa.md‎
Lines changed: 21 additions & 0 deletions
diff --git a/‎1-Introduction/translations/README.nl.md‎
Lines changed: 17 additions & 0 deletions b/‎1-Introduction/translations/README.nl.md‎
Lines changed: 17 additions & 0 deletions
diff --git a/‎2-Working-With-Data/06-non-relational/README.md‎
Lines changed: 1 addition & 1 deletion b/‎2-Working-With-Data/06-non-relational/README.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎2-Working-With-Data/07-python/README.md‎
Lines changed: 1 addition & 1 deletion b/‎2-Working-With-Data/07-python/README.md‎
Lines changed: 1 addition & 1 deletion
@@ -70,7 +70,7 @@
     "\r\n",
     "The next step is to convert the data into the form suitable for processing. In our case, we have downloaded HTML source code from the page, and we need to convert it into plain text.\r\n",
     "\r\n",
-    "There are many ways this can be done. We will use the simplest build-in [HTMLParser](https://docs.python.org/3/library/html.parser.html) object from Python. We need to subclass the `HTMLParser` class and define the code that will collect all text inside HTML tags, except `<script>` and `<style>` tags."
+    "There are many ways this can be done. We will use the simplest built-in [HTMLParser](https://docs.python.org/3/library/html.parser.html) object from Python. We need to subclass the `HTMLParser` class and define the code that will collect all text inside HTML tags, except `<script>` and `<style>` tags."
    ],
    "metadata": {}
   },
@@ -416,4 +416,4 @@
  },
  "nbformat": 4,
  "nbformat_minor": 2
-}
+}
@@ -0,0 +1,32 @@
+# Tarea: Escenarios de la ciencia de datos
+
+En esta primera tarea, os pedimos pensar sobre algún problema o proceso de la vida real en distintos contextos, y como se podrían solucionar o mejorar utilizando procesos de ciencia de datos. Piensa en lo siguiente:
+
+1. ¿Qué datos puedes obtener?
+1. ¿Cómo los obtendrías?
+1. ¿Cómo los almacenarías? ¿Qué tamaño es podemos esperar que tengan los datos?
+1. ¿Qué información podrías ser capaz de extraer de estos datos? ¿qué decisiones podríamos tomar basándonos en ellos?
+
+Intenta pensar en 3 diferentes problemas/procesos y describe cada uno de los puntos de arriba para el contexto de cada problema.
+
+Estos son algunos problemas o contextos que pueden ayudarte a empezar a pensar:
+
+1. ¿Cómo se pueden usar los datos para mejorar el proceso de educación de niños en los colegios?
+1. ¿Cómo podemos usar los datos para controlar la vacunación durante la pandemia?
+1. ¿Cómo se pueden usar los datos para asegurarnos de que somos productivos en nuestro trabajo?
+
+## Instrucciones
+
+Rellena la siguiente table (sustituye los problemas sugeridos por los propuestos por tí si es necesario):
+
+| Contexto del problema | Problema | Qué datos obtener | Cómo almacenar los datos | Qué información/decisiones podemos tomar | 
+|----------------|---------|-----------------------|-----------------------|--------------------------------------|
+| Educación | | | | |
+| Vacunación | | | | |
+| Productividad | | | | |
+
+## Rúbrica
+
+Ejemplar | Adecuada | Necesita mejorar
+--- | --- | -- |
+Es capaz de indentificar fuentes de datos razonables, formas de almacenarlos y posibles decisiones/información para todos los contextos | Algunos aspectos de la solución no están detallados, no se habla sobre el almacenamiento de los datos, al menos se describen dos contextos distintos | Solo se describen partes de la solución, solo se considera un contexto.
@@ -25,14 +25,14 @@ A benefit of structured data is that it can be organized in such a way that it c
 Examples of structured data: spreadsheets, relational databases, phone numbers, bank statements
 
 ### Unstructured Data
-Unstructured data typically cannot be categorized into into rows or columns and doesn't contain a format or set of rules to follow. Because unstructured data has less restrictions on its structure it's easier to add new information in comparison to a structured dataset. If a sensor capturing data on barometric pressure every 2 minutes has received an update that now allows it to measure and record temperature, it doesn't require altering the existing data if it's unstructured. However, this may make analyzing or investigating this type of data take longer. For example, a scientist who wants to find the average temperature of the previous month from the sensors data, but discovers that the sensor recorded an "e" in some of its recorded data to note that it was broken instead of a typical number, which means the data is incomplete.
+Unstructured data typically cannot be categorized into rows or columns and doesn't contain a format or set of rules to follow. Because unstructured data has less restrictions on its structure it's easier to add new information in comparison to a structured dataset. If a sensor capturing data on barometric pressure every 2 minutes has received an update that now allows it to measure and record temperature, it doesn't require altering the existing data if it's unstructured. However, this may make analyzing or investigating this type of data take longer. For example, a scientist who wants to find the average temperature of the previous month from the sensors data, but discovers that the sensor recorded an "e" in some of its recorded data to note that it was broken instead of a typical number, which means the data is incomplete.
 
 Examples of unstructured data: text files, text messages, video files
 
 ### Semi-structured
 Semi-structured data has features that make it a combination of structured and unstructured data. It doesn't typically conform to a format of rows and columns but is organized in a way that is considered structured and may follow a fixed format or set of rules. The structure will vary between sources, such as a well defined hierarchy to something more flexible that allows for easy integration of new information. Metadata are indicators that help decide how the data is organized and stored and will have various names, based on the type of data. Some common names for metadata are tags, elements, entities and attributes. For example, a typical email message will have a subject, body and a set of recipients and can be organized by whom or when it was sent. 
 
-Examples of unstructured data: HTML, CSV files, JavaScript Object Notation (JSON)
+Examples of semi-structured data: HTML, CSV files, JavaScript Object Notation (JSON)
 
 ## Sources of Data 
 
 
@@ -25,7 +25,7 @@ In the case of discrete random variables, it is easy to describe the probability
 
 The most well-known discrete distribution is **uniform distribution**, in which there is a sample space of N elements, with equal probability of 1/N for each of them. 
 
-It is more difficult to describe the probability distribution of a continuous variable, with values drawn from some interval [a,b], or the whole set of real numbers &Ropf;. Consider the case of bus arrival time. In fact, for each exact arrival time $t$, the probability of a bus arriving at exactly that time is 0!
+It is more difficult to describe the probability distribution of a continuous variable, with values drawn from some interval [a,b], or the whole set of real numbers &Ropf;. Consider the case of bus arrival time. In fact, for each exact arrival time *t*, the probability of a bus arriving at exactly that time is 0!
 
 > Now you know that events with 0 probability happen, and very often! At least each time when the bus arrives!
 
@@ -240,8 +240,8 @@ While this is definitely not exhaustive list of topics that exist within probabi
 ## 🚀 Challenge
 
 Use the sample code in the notebook to test other hypothesis that: 
-1. First basemen and older that second basemen
-2. First basemen and taller than third basemen
+1. First basemen are older than second basemen
+2. First basemen are taller than third basemen
 3. Shortstops are taller than second basemen
 
 ## [Post-lecture quiz](https://red-water-0103e7a0f.azurestaticapps.net/quiz/7)
 
@@ -12,7 +12,7 @@ cómo se definen los datos y un poco de probabilidad y estadística, el núcleo
 1. [Definiendo la Ciencia de Datos](../01-defining-data-science/README.md)
 2. [Ética de la Ciencia de Datos](../02-ethics/README.md)
 3. [Definición de Datos](../03-defining-data/translations/README.es.md)
-4. [introducción a la probabilidad y estadística](../04-stats-and-probability/README.md)
+4. [Introducción a la probabilidad y estadística](../04-stats-and-probability/README.md)
 
 ### Créditos
 
 
@@ -0,0 +1,21 @@
+<div dir="rtl">
+  
+# مقدمه‌ای بر علم داده
+
+
+![data in action](../images/data.jpg)
+> تصویر از <a href="https://unsplash.com/@dawson2406?utm_source=unsplash&utm_medium=referral&utm_content=creditCopyText">Stephen Dawson</a> در <a href="https://unsplash.com/s/photos/data?utm_source=unsplash&utm_medium=referral&utm_content=creditCopyText">Unsplash</a>
+
+شما در این بخش با تعریف علم داده و ملاحظات اخلاقی که یک دانشمند علوم داده باید در نظر داشته باشد آشنا خواهید شد. همچنین با تعریف داده و کمی هم با آمار و احتمالات که پایه و اساس علم داده است آشنا خواهید شد. 
+
+### سرفصل ها
+
+1. [تعریف علم داده](../01-defining-data-science/README.md)
+2. [اصول اخلاقی علم داده](../02-ethics/README.md)
+3. [تعریف داده](../03-defining-data/README.md)
+4. [مقدمه ای بر آمار و احتمال](../04-stats-and-probability/README.md)
+
+### تهیه کنندگان
+
+این درس ها با ❤️ توسط [Nitya Narasimhan](https://twitter.com/nitya) و [Dmitry Soshnikov](https://twitter.com/shwars) تهیه شده است.
+</div>
@@ -0,0 +1,17 @@
+# Inleiding tot datawetenschap
+
+![data in actie](images/data.jpg)
+> Beeld door <a href="https://unsplash.com/@dawson2406?utm_source=unsplash&utm_medium=referral&utm_content=creditCopyText">Stephen Dawson</a> op <a href="https://unsplash.com/s/photos/data?utm_source=unsplash&utm_medium=referral&utm_content=creditCopyText">Unsplash</a>
+  
+In deze lessen ontdek je hoe Data Science wordt gedefinieerd en leer je over ethische overwegingen waarmee een datawetenschapper rekening moet houden. Je leert ook hoe gegevens worden gedefinieerd en leert over statistiek en waarschijnlijkheid, de academische kerndomeinen van Data Science.
+
+### Onderwerpen
+
+1. [Data Science definiëren](01-defining-data-science/README.md)
+2. [Ethiek in Data Science](02-ethics/README.md)
+3. [Data definiëren](03-defining-data/README.md)
+4. [Inleiding tot statistiek en kansrekening](04-stats-and-probability/README.md)
+
+### Credits
+
+Dit lesmateriaal is met liefde ❤️ geschreven door [Nitya Narasimhan](https://twitter.com/nitya) en [Dmitry Soshnikov](https://twitter.com/shwars).
@@ -49,7 +49,7 @@ NoSQL is an umbrella term for the different ways to store non-relational data an
 
 ![Graphical representation of a columnar data store showing a customer database with two column families named Identity and Contact Info](images/columnar-db.png)
 
-[Columnar](https://docs.microsoft.com/en-us/azure/architecture/data-guide/big-data/non-relational-data#columnar-data-stores) data stores organizes data into columns and rows like a relational data structure but each column is divided into groups called a column family, where the all the data under one column is related and can be retrieved and changed in one unit. 
+[Columnar](https://docs.microsoft.com/en-us/azure/architecture/data-guide/big-data/non-relational-data#columnar-data-stores) data stores organizes data into columns and rows like a relational data structure but each column is divided into groups called a column family, where all the data under one column is related and can be retrieved and changed in one unit. 
 
 
 ### Document Data Stores with the Azure Cosmos DB 
 
@@ -52,7 +52,7 @@ Pandas is centered around a few basic concepts.
 
 ### Series 
 
-**Series** is a sequence of values, similar to a list or numpy array. The main difference is that series also has and **index**, and when we operate on series (eg., add them), the index is taken into account. Index can be as simple as integer row number (it is the index used by default when creating a series from list or array), or it can have a complex structure, such as date interval.
+**Series** is a sequence of values, similar to a list or numpy array. The main difference is that series also has an **index**, and when we operate on series (eg., add them), the index is taken into account. Index can be as simple as integer row number (it is the index used by default when creating a series from list or array), or it can have a complex structure, such as date interval.
 
 > **Note**: There is some introductory Pandas code in the accompanying notebook [`notebook.ipynb`](notebook.ipynb). We only outline some the examples here, and you are definitely welcome to check out the full notebook.