MEMBERSHIP

Individual Members (1732)

Membership is FREE for data scientists, stay connected and build a profile

University (20)

Provide course information and participate in the peer-review publishing programme

Corporate (38)

We will assist your search for qualified and approved data scientists

Supplier (23)

Develop connections and improve your profile within the data science sector

Data Talk Articles

WHITEPAPERS

View All

LATEST DISCUSSION

Ocean Protocol - #1 Data Economy Challenge;

25 February 2020 | Sebastian Howler

Dear peopleI have recently introtuced Ocean Protocol to you all and I'm happy to say since then a Data Economy Challenge has been held. This was the first of many where participants could build on the protocol and get rewarded for it. The winners even get support to fully build out their business. Check out the submissions here and winners here:https://oceanprotocol.devpost.com/submissionsAlso there's a new forum coming up where you can contact devs and participants. https://port.oceanprotocol.com/Let me know what do you think

READ MORE

Data science courses online;

24 February 2020 | Divya Jain

Data science online certification course on Geeklurn https://www.geeklurn.com/data-science-with-python/

READ MORE

Univariate;

22 February 2020 | Abhishek Mishra

Univariate data contains only one variable. The purpose of the univariate analysis is to describe the data and find patterns that exist within it.

READ MORE

How can you avoid the overfitting of your model?;

22 February 2020 | Abhishek Mishra

Keep the model simple—take fewer variables into account, thereby removing some of the noise in the training dataUse cross-validation techniques, such as k folds cross-validation Use regularization techniques, such as LASSO, that penalize certain model parameters if they're likely to cause overfitting

READ MORE

Steps to build a random forest model:;

22 February 2020 | Abhishek Mishra

Randomly select 'k' features from a total of'm' features where k << mAmong the 'k' features, calculate the node D using the best split pointSplit the node into daughter nodes using the best splitRepeat steps two and three until leaf nodes are finalized Build forest by repeating steps one to four for 'n' times to create 'n' number of trees

READ MORE

Steps in making a decision tree;

22 February 2020 | Abhishek Mishra

Take the entire data set as inputCalculate entropy of the target variable, as well as the predictor attributesCalculate your information gain of all attributes (we gain information on sorting different objects from each other)Choose the attribute with the highest information gain as the root node Repeat the same procedure on every branch until the decision node of each branch is finalized

READ MORE

Logistic Regression;

22 February 2020 | Abhishek Mishra

Logistic regression measures the relationship between the dependent variable (our label of what we want to predict) and one or more independent variables (our features) by estimating probability using its underlying logistic function (sigmoid).

READ MORE

My recently published article on Hybrid Machine Learning;

20 February 2020 | Fatai Anifowose

Thanks for sharing. It is indeed a interesting read.

READ MORE

non-Gaussian distribution;

16 February 2020 | Abhishek Mishra

Any simpler explanation?

READ MORE

What is the Binomial Probability Formula;

16 February 2020 | Abhishek Mishra

The binomial distribution consists of the probabilities of each of the possible numbers of successes on N trials for independent events that each have a probability of π (the Greek letter pi) of occurring.

READ MORE

What is a statistical interaction?;

16 February 2020 | Abhishek Mishra

Basically, an interaction is when the effect of one factor (input variable) on the dependent variable (output variable) differs among levels of another factor.

READ MORE

What is the Central Limit Theorem and why is it important?;

16 February 2020 | Abhishek Mishra

he central limit theorem states that if you have a population with mean μ and standard deviation σ and take sufficiently large random samples from the population with replacement , then the distribution of the sample means will be approximately normally distributed..Whats your thought?

READ MORE

Regularization;

16 February 2020 | Abhishek Mishra

Regularization is a process of constraining the learning of the model to reduce overfitting.

READ MORE

How can you avoid the overfitting of your model?;

16 February 2020 | Abhishek Mishra

Cross-validation.Train with more data.Remove features. ...Early stopping. ...Regularization. ...Ensembling. What are some of the other ideas?

READ MORE

Black Box;

16 February 2020 | Abhishek Mishra

Its not magical. Black box algorithms are the complex code at the heart of systems

READ MORE
View All

LATEST NEWS

The Age of Big Data

20 February 2020

GOOD with numbers? Fascinated by data? The sound you hear is opportunity knocking. Mo Zhou was snapped

READ MORE

White House Earmarks New Money for A.I. and Quantum Computing

11 February 2020

The technologies are expected to become an important part of national security, and some worry the United States is behind China in their development.

READ MORE

5 critical issues solved by DataOps

08 February 2020

This article examines the practical uses of DataOps, its advantages, and how it can solve critical issues.

READ MORE

How Blackstone uses data scientists to win deals

05 February 2020

Investment banks and hedge funds aren't alone in incorporating data science into their business models. Private equity funds are also turning to data science, both to win deals in the first place and to help them manage portfolio companies after a purchase

READ MORE

Coronavirus: Can AI (Artificial Intelligence) Make A Difference?

02 February 2020

The mysterious coronavirus is spreading at an alarming rate. There have been at least 305 deaths as more than 14,300 persons have been infected.

READ MORE

Combining Data Science, Machine Learning and Frontline Expertise

31 January 2020

When thinking about using big data, back-end processes, such as billing, cannot be ignored. Most healthcare organizations engage in a complicated receivables process involving multiple vendors. These vendors work with healthcare organizations on boutique processes ranging from payer denials to patient collections.

READ MORE

The Growing Need for Data Scientists

26 January 2020

By 2020 there will be over 2.7 million data scientist job openings to take on this massive growth.

READ MORE

The battle for ethical AI at the world’s biggest machine-learning conference

25 January 2020

Bias and the prospect of societal harm increasingly plague artificial-intelligence research — but it’s not clear who should be on the lookout for these problems.

READ MORE

Data science in GAD helps clients

15 January 2020

GAD uses data science to identify patterns, gain detailed insights and offer better advice to clients.

READ MORE

Artificial intelligence is helping us talk to animals. Yes, really

29 December 2019

AI has helped us decode ancient languages, and now researchers are turning the same technique to help understand our pets

READ MORE

Clear standards required for development and use of AI in healthcare

21 December 2019

With digital technologies set to irrevocably change the face of our healthcare systems, the ethical concerns surrounding the use of artificial intelligence (AI) are increasingly gaining prominence in policy circles.

READ MORE

Three ways data science is unbaking the cake

14 December 2019

Here are three ways data science is already doing the impossible

READ MORE

AI DEEMED ‘TOO DANGEROUS TO RELEASE’ MAKES IT OUT INTO THE WORLD

09 November 2019

Researchers had feared that the model, known as "GPT-2", was so powerful that it could be maliciously misused by everyone from politicians to scammers.

READ MORE

Opinion: Why we should be worried about artificial intelligence on Wall Street

02 November 2019

Until recently, artificial intelligence has struggled to gain a foothold on Wall Street. No longer.

READ MORE

What Can AI and Big Data Do for Finance?

23 October 2019

AI and big data represent the future of investing. Their broad application is likely to usher in perhaps the most significant change in the history of the industry. Why? Because with AI and big data: Analysts will be able to perform more thorough analysis. Portfolio managers will make better informed decisions.

READ MORE
View All

INFORMATION ON DATA SCIENCE

The Data Science Foundation is your source of information on big data techniques and practices. We review and publish industry news, comment and white papers submitted by our members. Our information includes various data science courses available and offered by different universities and online data science course administrators as well as information on data science training & certification. We are recruiting editors and contributors to work with us on this site. Get in touch.

BIG DATA

We encourage everyone who works in big data or with data scientists to join us and share their knowledge and experiences with the community. We provide support and network opportunities to data science practitioners and managers buying advanced analytical services.