Data Science & Machine Learning

Data Science & Machine Learning

Public

Can't Join? @datasciencefun

0

75.2k Members

Updated: May 26, 2026 at 3:09 AM

Data Science & Machine Learning

Join this channel to learn data science, artificial intelligence and machine learning with funny quizzes, interesting projects and amazing resources for free For collaborations: @love_data

@datasciencefun is a growing community focused on science and machine and related topics

Ranking

Global Ranking

#29965

-2360

Language Ranking

#3844

-90

Category Ranking

#364

-15

Participant Growth (Last 25 Days)

Total: 75.2K

24h growth: +0 0%

Loading posts...

Rating

Login required

Loading reviews...

Loading recommended channels...

Latest Posts

Data Science & Machine Learning

May 17, 2026, 04:15 PM

[poll]

2,120

2

0

Data Science & Machine Learning

May 17, 2026, 04:15 PM

[poll]

2,270

3

0

Data Science & Machine Learning

May 17, 2026, 04:15 PM

[poll]

1,820

2

0

Data Science & Machine Learning

May 17, 2026, 04:15 PM

[poll]

1,900

3

0

Data Science & Machine Learning

May 17, 2026, 04:15 PM

📷 Photo

𝗔𝗜 𝗮𝗻𝗱 𝗠𝗟 𝗣𝗿𝗼𝗴𝗿𝗮𝗺 𝗯𝘆 𝗖𝗖𝗘, 𝗜𝗜𝗧 𝗠𝗮𝗻𝗱𝗶😍

Freshers get 15 LPA Average Salary with AI & ML Skills!

💻 100% Online
⏳ 6 Months Duration
👨‍🏫 Learn from IIT Professors
📌 Open for Students ,Freshers & Working Professionals

💼 Placement Assistance with 5000+ Companies
📈 High Demand Skills for Future Tech Jobs

Top companies are hiring for candidates with 𝗔𝗜, 𝗠𝗮𝗰𝗵𝗶𝗻𝗲 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴 skills in 2026

🔥Deadline :- 17th May

𝗔𝗽𝗽𝗹𝘆 𝗡𝗼𝘄👇 :-

https://pdlink.in/4nmI024
.
Get Placement Assistance With 5000+ Companies

1,640

5

Data Science & Machine Learning

May 17, 2026, 04:15 PM

[poll]

1,800

1

0

Data Science & Machine Learning

May 17, 2026, 04:15 PM

Some useful PYTHON libraries for data science

NumPy stands for Numerical Python. The most powerful feature of NumPy is n-dimensional array. This library also contains basic linear algebra functions, Fourier transforms,  advanced random number capabilities and tools for integration with other low level languages like Fortran, C and C++

SciPy stands for Scientific Python. SciPy is built on NumPy. It is one of the most useful library for variety of high level science and engineering modules like discrete Fourier transform, Linear Algebra, Optimization and Sparse matrices.

Matplotlib for plotting vast variety of graphs, starting from histograms to line plots to heat plots.. You can use Pylab feature in ipython notebook (ipython notebook –pylab = inline) to use these plotting features inline. If you ignore the inline option, then pylab converts ipython environment to an environment, very similar to Matlab. You can also use Latex commands to add math to your plot.

Pandas for structured data operations and manipulations. It is extensively used for data munging and preparation. Pandas were added relatively recently to Python and have been instrumental in boosting Python’s usage in data scientist community.

Scikit Learn for machine learning. Built on NumPy, SciPy and matplotlib, this library contains a lot of efficient tools for machine learning and statistical modeling including classification, regression, clustering and dimensionality reduction.

Statsmodels for statistical modeling. Statsmodels is a Python module that allows users to explore data, estimate statistical models, and perform statistical tests. An extensive list of descriptive statistics, statistical tests, plotting functions, and result statistics are available for different types of data and each estimator.

Seaborn for statistical data visualization. Seaborn is a library for making attractive and informative statistical graphics in Python. It is based on matplotlib. Seaborn aims to make visualization a central part of exploring and understanding data.

Bokeh for creating interactive plots, dashboards and data applications on modern web-browsers. It empowers the user to generate elegant and concise graphics in the style of D3.js. Moreover, it has the capability of high-performance interactivity over very large or streaming datasets.

Blaze for extending the capability of Numpy and Pandas to distributed and streaming datasets. It can be used to access data from a multitude of sources including Bcolz, MongoDB, SQLAlchemy, Apache Spark, PyTables, etc. Together with Bokeh, Blaze can act as a very powerful tool for creating effective visualizations and dashboards on huge chunks of data.

Scrapy for web crawling. It is a very useful framework for getting specific patterns of data. It has the capability to start at a website home url and then dig through web-pages within the website to gather information.

SymPy for symbolic computation. It has wide-ranging capabilities from basic symbolic arithmetic to calculus, algebra, discrete mathematics and quantum physics. Another useful feature is the capability of formatting the result of the computations as LaTeX code.

Requests for accessing the web. It works similar to the the standard python library urllib2 but is much easier to code. You will find subtle differences with urllib2 but for beginners, Requests might be more convenient.

Additional libraries, you might need:

os for Operating system and file operations

networkx and igraph for graph based data manipulations

regular expressions for finding patterns in text data

BeautifulSoup for scrapping web. It is inferior to Scrapy as it will extract information from just a single webpage in a run.

Data Science & Machine Learning

May 17, 2026, 04:15 PM

📷 Photo

🗄️ 𝗧𝗼𝗽 𝟱 𝗙𝗥𝗘𝗘 𝗦𝗤𝗟 𝗖𝗲𝗿𝘁𝗶𝗳𝗶𝗰𝗮𝘁𝗶𝗼𝗻 𝗖𝗼𝘂𝗿𝘀𝗲𝘀 🚀

SQL is one of the most important skills for Data Analyst & Tech jobs in 2026 🔥
These FREE certification courses can help you learn SQL from scratch & boost your resume 💼

✨ Learn:
✔ SQL Queries & Databases 🗄️
✔ Data Analysis Basics 📊
✔ Real-world Projects
✔ Beginner to Advanced Concepts

𝗘𝗻𝗿𝗼𝗹𝗹 𝗙𝗼𝗿 𝗙𝗥𝗘𝗘👇:-

https://pdlink.in/4dCHiKI

💯 Beginner Friendly + FREE Certificates 🎓
💼 Perfect for Students, Freshers & Career Switchers

1,830

6

Data Science & Machine Learning

May 17, 2026, 04:15 PM

✅ K-Nearest Neighbors (KNN) Basics📍🤖

KNN is a simple and powerful algorithm that makes predictions based on similar nearby data points.

🔹 1. What is KNN?
KNN = K-Nearest Neighbors
• It classifies a new data point based on the nearest neighbors around it.

🔥 2. How KNN Works
Step-by-step:
1. Choose value of K
2. Find nearest data points
3. Count categories of neighbors
4. Majority category becomes prediction

🔹 3. Example
Predict if a fruit is Apple or Orange 🍎🍊
• If most nearby fruits are Apples → Prediction = Apple.

🔹 4. What is K?
K = Number of nearest neighbors.

Example:
• K = 3 → Check nearest 3 neighbors
• K = 5 → Check nearest 5 neighbors

🔹 5. Distance Measurement ⭐
KNN uses distance to find nearest points.

Most common: Euclidean Distance

d = sqrt((x2 - x1)² + (y2 - y1)²)

Where:
• d = distance between two points
• x1, y1 = coordinates of first point
• x2, y2 = coordinates of second point

Example:
Point A = (1, 2) and Point B = (4, 6)
d = sqrt((4 - 1)² + (6 - 2)²) = sqrt(3² + 4²) = sqrt(9 + 16) = sqrt(25) = 5

🔹 6. Implementation (Python)

from sklearn.neighbors import KNeighborsClassifier

# Sample data
X = [[1], [2], [3], [4]]
y = [0, 0, 1, 1]

model = KNeighborsClassifier(n_neighbors=3)
model.fit(X, y)

print(model.predict([[2.5]]))


🔹 7. Advantages ⭐
• Easy to understand
• No training phase
• Works well for small datasets

🔹 8. Disadvantages
• Slow for large datasets
• Sensitive to irrelevant features
• Needs feature scaling

🔹 9. Why KNN is Important?
• Beginner-friendly ML algorithm
• Used in recommendation systems
• Important interview topic

🎯 Today’s Goal
• Understand nearest neighbors
• Learn value of K
• Understand distance concept

KNN = Prediction based on similarity 📍🔥

💬 Tap ❤️ for more!

Data Science & Machine Learning

May 17, 2026, 04:15 PM

📷 Photo

Want to start your career in 𝗔𝗜 & 𝗗𝗮𝘁𝗮 𝗦𝗰𝗶𝗲𝗻𝗰𝗲😍?

Learn from IIIT Bangalore & upGrad

💫 Beginner Friendly
💫 Industry Recognized Certificate
💫High Demand Career Skills

𝗕𝗼𝗼𝗸 𝗙𝗥𝗘𝗘 𝗖𝗼𝘂𝗻𝘀𝗲𝗹𝗹𝗶𝗻𝗴👇Now & explore your career roadmap

https://pdlink.in/4twH9xg

🎓Top roles you can target:
* Data Analyst , AI Engineer ,Machine Learning Engineer & Data Scientist

1,800

4

Data Science & Machine Learning

Mar 28, 2026, 02:55 AM

📷 Photo

📢 𝗖𝗲𝗿𝘁𝗶𝗳𝗶𝗰𝗮𝘁𝗶𝗼𝗻 𝗔𝗹𝗲𝗿𝘁 – Data Analytics with Artificial Intelligence

Upgrade your career with AI-powered data science skills.
*Open for all. No Coding Background Required*

📊 Learn Data Analytics with Artificial Intelligence from Scratch
🤖 AI Tools & Automation
📈 Build real world Projects for job ready portfolio
🎓 E&ICT IIT Roorkee Certification Program

🔥Deadline :- 22nd March

𝗔𝗽𝗽𝗹𝘆 𝗡𝗼𝘄 👇 :-  https://pdlink.in/4tkErvS

Don't Miss This Opportunity. Get Placement Assistance With 5000+ Companies

1,970

1

Data Science & Machine Learning

Mar 28, 2026, 02:55 AM

[poll]

2,660

5

0

Data Science & Machine Learning

Mar 28, 2026, 02:55 AM

[poll]

2,930

3

0

Data Science & Machine Learning

Mar 28, 2026, 02:55 AM

[poll]

3,030

6

0

Data Science & Machine Learning

Mar 28, 2026, 02:55 AM

📷 Photo

𝗙𝗿𝗲𝘀𝗵𝗲𝗿𝘀 𝗖𝗮𝗻 𝗚𝗲𝘁 𝗮 𝟯𝟬 𝗟𝗣𝗔 𝗝𝗼𝗯 𝗢𝗳𝗳𝗲𝗿 𝘄𝗶𝘁𝗵 𝗔𝗜 & 𝗗𝗦 𝗖𝗲𝗿𝘁𝗶𝗳𝗶𝗰𝗮𝘁𝗶𝗼𝗻😍

IIT Roorkee offering AI & Data Science Certification Program

💫Learn from IIT ROORKEE Professors
✅ Students & Fresher can apply
🎓 IIT Certification Program
💼 5000+ Companies Placement Support

Deadline: 22nd March 2026

📌 𝗥𝗲𝗴𝗶𝘀𝘁𝗲𝗿 𝗡𝗼𝘄 👇 :-

https://pdlink.in/4kucM7E

Big Opportunity, Do join asap!

2,310

4

Data Science & Machine Learning

Mar 28, 2026, 02:55 AM

[poll]

2,580

3

0

Data Science & Machine Learning

Mar 28, 2026, 02:55 AM

[poll]

2,600

6

0

Data Science & Machine Learning

Mar 28, 2026, 02:55 AM

✅ NumPy Basics 🐍📊

NumPy (Numerical Python) is the most important library for numerical computing in Python.

It is widely used in:
✔ Data Science
✔ Machine Learning
✔ AI
✔ Scientific computing

🔹 1. What is NumPy?

NumPy provides a powerful data structure called NumPy Array. It is faster and more efficient than Python lists for mathematical operations.

Example:
import numpy as np

🔹 2. Creating a NumPy Array

From a List

import numpy as np
arr = np.array([1, 2, 3, 4])
print(arr)

Output:
[1 2 3 4]

🔹 3. Check Array Type

print(type(arr))

Output:

🔹 4. NumPy Array Operations

Addition:

import numpy as np
arr = np.array([1, 2, 3])
print(arr + 2)

Output:
[3 4 5]

Multiplication:
print(arr * 2)

Output:
[2 4 6]

🔹 5. NumPy Built-in Functions

arr = np.array([10, 20, 30, 40])
print(arr.sum())
print(arr.mean())
print(arr.max())
print(arr.min())

Output:
100
25.0
40
10

🔹 6. NumPy Array Shape

arr = np.array([[1, 2, 3], [4, 5, 6]])
print(arr.shape)

Output:
(2, 3)

Meaning: 2 rows and 3 columns.

🔹 7. Why NumPy is Important?

NumPy is the foundation of data science libraries:
✔ Pandas
✔ Scikit-Learn
✔ TensorFlow
✔ PyTorch

All these libraries use NumPy internally.

🎯 Today's Goal
✔ Install NumPy
✔ Create arrays
✔ Perform math operations
✔ Understand array shape

Double Tap ♥️ For More

2,610

Data Science & Machine Learning

Mar 28, 2026, 02:55 AM

✅ Python Exception Handling (try–except) 🐍⚠️

Exception handling helps programs handle errors gracefully instead of crashing.

👉 Very important in real-world applications and data processing.

🔹 1. What is an Exception?

An exception is an error that occurs during program execution.

Example:
print(10 / 0)
Output: ZeroDivisionError

This will crash the program.

🔹 2. Using try–except

We use try–except to handle errors.

Syntax:
try:
# code that may cause error
except:
# code to handle error
Example:
try:
x = 10 / 0
except:
print("Error occurred")
Output: Error occurred

🔹 3. Handling Specific Exceptions

try:
num = int("abc")
except ValueError:
print("Invalid number")
✔ Handles only ValueError.

🔹 4. Using else

else runs if no error occurs.

try:
x = 10 / 2
except:
print("Error")
else:
print("No error")
Output: No error

🔹 5. Using finally

finally always executes.

try:
file = open("data.txt")
except:
print("File not found")
finally:
print("Execution completed")

🔹 6. Common Python Exceptions

• ZeroDivisionError: Division by zero
• ValueError: Invalid value
• TypeError: Wrong data type
• FileNotFoundError: File does not exist

🎯 Today's Goal

✔ Understand exceptions
✔ Use try–except
✔ Handle specific errors
✔ Use else and finally

👉 Exception handling is widely used in data pipelines and production code.

Double Tap ♥️ For More

2,940

Data Science & Machine Learning

Mar 28, 2026, 02:55 AM

SQL, or Structured Query Language, is a domain-specific language used to manage and manipulate relational databases. Here's a brief A-Z overview by https://t.me/sqlanalyst

A - Aggregate Functions: Functions like COUNT, SUM, AVG, MIN, and MAX used to perform operations on data in a database.

B - BETWEEN: A SQL operator used to filter results within a specific range.

C - CREATE TABLE: SQL statement for creating a new table in a database.

D - DELETE: SQL statement used to delete records from a table.

E - EXISTS: SQL operator used in a subquery to test if a specified condition exists.

F - FOREIGN KEY: A field in a database table that is a primary key in another table, establishing a link between the two tables.

G - GROUP BY: SQL clause used to group rows that have the same values in specified columns.

H - HAVING: SQL clause used in combination with GROUP BY to filter the results.

I - INNER JOIN: SQL clause used to combine rows from two or more tables based on a related column between them.

J - JOIN: Combines rows from two or more tables based on a related column.

K - KEY: A field or set of fields in a database table that uniquely identifies each record.

L - LIKE: SQL operator used in a WHERE clause to search for a specified pattern in a column.

M - MODIFY: SQL command used to modify an existing database table.

N - NULL: Represents missing or undefined data in a database.

O - ORDER BY: SQL clause used to sort the result set in ascending or descending order.

P - PRIMARY KEY: A field in a table that uniquely identifies each record in that table.

Q - QUERY: A request for data from a database using SQL.

R - ROLLBACK: SQL command used to undo transactions that have not been saved to the database.

S - SELECT: SQL statement used to query the database and retrieve data.

T - TRUNCATE: SQL command used to delete all records from a table without logging individual row deletions.

U - UPDATE: SQL statement used to modify the existing records in a table.

V - VIEW: A virtual table based on the result of a SELECT query.

W - WHERE: SQL clause used to filter the results of a query based on a specified condition.

X - (E)XISTS: Used in conjunction with SELECT to test the existence of rows returned by a subquery.

Z - ZERO: Represents the absence of a value in numeric fields or the initial state of boolean fields.

Data Science & Machine Learning

Mar 25, 2026, 09:34 PM

📷 Photo

Top Programming Languages for Beginners 👆

3,120

5

Data Science & Machine Learning

Mar 25, 2026, 09:34 PM

[poll]

2,940

1

0

Data Science & Machine Learning

Mar 25, 2026, 09:34 PM

[poll]

3,140

4

0

Data Science & Machine Learning

Mar 24, 2026, 05:17 AM

[poll]

2,430

2

0

Data Science & Machine Learning

Mar 24, 2026, 05:17 AM

[poll]

2,390

2

0

Data Science & Machine Learning

Mar 24, 2026, 05:17 AM

[poll]

2,630

1

0

Data Science & Machine Learning

Mar 22, 2026, 09:51 PM

Data Science Roadmap

✅ Python File Handling

🐍📂 File handling allows Python programs to read and write data from files.

👉 Very important in data science because most datasets come as:
✔ CSV files
✔ Text files
✔ Logs
✔ JSON files

🔹 1. Opening a File
Python uses the open() function.
Syntax: open("filename", "mode")
Example: file = open("data.txt", "r")
👉 "r" → Read mode

🔹 2. File Modes
- "r" → Read file
- "w" → Write file (overwrites existing content)
- "a" → Append file (adds to existing content)
- "r+" → Read and write

🔹 3. Reading a File
- Read Entire File: file.read()
- Read One Line: file.readline()
- Read All Lines: file.readlines()

🔹 4. Writing to a File
file = open("data.txt", "w")
file.write("Hello Data Science")
file.close()

⚠ "w" will overwrite existing content.

🔹 5. Append to File
file = open("data.txt", "a")
file.write("\nNew line added")
file.close()

✔ Adds content without deleting old data.

🔹 6. Best Practice (Very Important ⭐)
Use with statement.
with open("data.txt", "r") as file:
content = file.read()
print(content)

✔ Automatically closes the file.

🔹 7. Why File Handling is Important?
Used for:
✔ Reading datasets
✔ Saving results
✔ Logging machine learning models
✔ Data preprocessing

🎯 Today’s Goal
✔ Understand file modes
✔ Read files
✔ Write files
✔ Use with open()

👉 File handling is used heavily when working with CSV datasets in data science.

Double Tap ♥️ For More

2,430

Data Science & Machine Learning

Mar 22, 2026, 09:51 PM

[poll]

2,750

7

0

Data Science & Machine Learning

Mar 21, 2026, 08:57 PM

📷 Photo

🤖 𝗔𝗜 + 𝗗𝗮𝘁𝗮 = 𝗧𝗵𝗲 𝗙𝘂𝘁𝘂𝗿𝗲 𝗼𝗳 𝗝𝗼𝗯𝘀

Start your journey in Data Analytics & Data Science with AI Certification and gain skills companies are actively hiring for.

📊 Data Analysis
🐍 Python Programming
🤖 Machine Learning
📈 AI-Driven Insights

🔥 Perfect for College Students ,Freshers & Professionals

1️⃣𝗣𝘆𝘁𝗵𝗼𝗻 :- https://pdlink.in/3OD9jI1

2️⃣𝗗𝗮𝘁𝗮 𝗦𝗰𝗶𝗲𝗻𝗰𝗲 :- https://pdlink.in/4kucM7E

3️⃣𝗗𝗮𝘁𝗮 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 :- https://pdlink.in/4ay4wPG

4️⃣𝗕𝘂𝘀𝗶𝗻𝗲𝘀𝘀 𝗔𝗻𝗮𝗹𝘆𝘁𝗶𝗰𝘀 :- https://pdlink.in/3ZtIZm9

5️⃣𝗔𝗜 & 𝗠𝗮𝗰𝗵𝗶𝗻𝗲 𝗟𝗲𝗮𝗿𝗻𝗶𝗻𝗴 :- https://pdlink.in/4rMivIA

Don't Miss This Opportunity . Get Placement Assistance With 5000+ Companies

2,390

Data Science & Machine Learning

Mar 21, 2026, 08:57 PM

📷 Photo

🚀 𝗪𝗮𝗻𝘁 𝘁𝗼 𝗕𝗲𝗰𝗼𝗺𝗲 𝗮 𝗙𝘂𝗹𝗹 𝗦𝘁𝗮𝗰𝗸 𝗗𝗲𝘃𝗲𝗹𝗼𝗽𝗲𝗿 𝗶𝗻 𝟮𝟬𝟮𝟲?

Tech companies are hiring developers with React, JavaScript, Node.js & MongoDB skills.

This Full Stack Development Program helps you learn everything from scratch with real projects.

💡 Perfect for:
* Beginners
* Students
* Career switchers

𝗥𝗲𝗴𝗶𝘀𝘁𝗲𝗿 𝗡𝗼𝘄 👇:-

https://pdlink.in/4hO7rWY

⚡ Don’t miss this chance to enter the high-paying tech industry!

1,960

4

Showing 30 of 58 posts