Nowadays organizations are starting to realize the importance of using more data in order to support decision for their strategies. The size of data in world is growing day by day. Data is growing because of vast use of internet, smart phone and social network. Big data is a collection of data sets which is very large in size as well as complex. Generally size of the data is Petabyte and Exabyte. Traditional database systems are not able to capture, store and analyze this large amount of data. As the internet is growing, amount of big data continue to grow. Big data analytics provide new ways for businesses and government to analyze unstructured data. Nowadays, Big data is one of the most talked topic in IT industry. It is going to play important role in future. Big data changes the way that data is managed and used. Some of the applications are in areas such as healthcare, defense, traffic management, banking, agriculture, retail, education and so on. Organizations are becoming more flexible and more open. New types of data will give new challenges as well.
The biggest challenge of genetic research lies in significant and intellectual analysis of the large and complex data sets generated by the cutting edge techniques like massively parallel DNA sequencing and genome wide analysis. Statistical analyses are the most important of such experimental data. When the data are not normally distributed and using non numerical (rank, categorical) data then use the nonparametric test for exact result of research hypothesis. Order statistics are among the most fundamental tools in non-parametric statistics and inference. Non parametric test does not depend upon parameters of the population from which the samples are drawn, no strict assumption about the distribution of the population. Nonparametric tests are known as distribution free test also because their assumptions are less and weaker than those connected with parametric test. Nonparametric test does not follow probability distribution. To analyze microarrays and genomics data several non-parametric statistical techniques are used like Wilcoxon’s signed rank test (pre-post group),Mann-Whitney U test (two groups) or Kruskal-Wallis test (two or more groups).Importance of this paper is to look at the non-parametric test how to use in genetic research and provide the understanding of these test
This paper discusses the role of the data scientist, what a data scientist is, and the set of skills needed to become one.
Ultimately, the widespread adoption of automation and robotics, as well as the rise of artificial intelligence will have profound impacts on the world, particularly in terms of human employment and even the global economy, such as the questions asked by Martin Ford, in his book, Rise of the Robots. Martin Ford suggests the outlook is bleak for millions of workers who define their self-worth in terms of their employment. Increasingly workers will no longer be exploited by those in control of capital and intelligent machines, they will be irrelevant to them.