Granular computing with multiple granular layers for brain big data processing

Wang, Guoyin; Xu, Ji

doi:10.1007/s40708-014-0001-z

Article
Open access
Published: 06 September 2014

Granular computing with multiple granular layers for brain big data processing

Guoyin Wang^1,2 &
Ji Xu^1,3

Brain Informatics volume 1, pages 1–10 (2014)Cite this article

4901 Accesses
19 Citations
Metrics details

Abstract

Big data is the term for a collection of datasets so huge and complex that it becomes difficult to be processed using on-hand theoretical models and technique tools. Brain big data is one of the most typical, important big data collected using powerful equipments of functional magnetic resonance imaging, multichannel electroencephalography, magnetoencephalography, Positron emission tomography, near infrared spectroscopic imaging, as well as other various devices. Granular computing with multiple granular layers, referred to as multi-granular computing (MGrC) for short hereafter, is an emerging computing paradigm of information processing, which simulates the multi-granular intelligent thinking model of human brain. It concerns the processing of complex information entities called information granules, which arise in the process of data abstraction and derivation of information and even knowledge from data. This paper analyzes three basic mechanisms of MGrC, namely granularity optimization, granularity conversion, and multi-granularity joint computation, and discusses the potential of introducing MGrC into intelligent processing of brain big data.

1 Introduction of big data and data science

To gain an insight of philosophy into the nature of brain data and the significance of processing it, we would firstly introduce a broad view on some related concepts such as physical space, social space, data space, natural sciences, social sciences, and data sciences.

There have been for a long history the physical space and social space to describe the phenomena in natural world and human society, respectively, and the research on the spaces leads to natural science and social science. In recent years, the ubiquitous digitalization of both natural world and human society has produced huge amount of data. Along with “big data” becoming a hot topic for researchers, entrepreneurs and government officials, people realize that a data space has come into existence.

The connection and interaction among people is the one of the key sources of human intelligence; in other words, the interactions of elements in social space produce human intelligence. So, similarly, it is expected that the relations and interactions of entities in data space would produce other forms of intelligence such as machine intelligence and web intelligence [1].

The data space is “relatively independent” of physical space and social space, since it remains stable in a way despite being a reflection of them. Once the data have been generated, they will not evolve accordingly as the described objects change if no special mechanism is arranged. One dataset as a mirror of entities from natural world or human society would yield new results if interacted with others, and then the results may have reaction on natural world or human society with the assistance of automatic control devices or human beings. The data may have powerful reaction to the real world even if it is fabricated, e.g., rumors spread via mobile phones and the Internet played a vicious role in the London riot 2011 [2].

It would be agreed that research on data space will lead to data science which has many differences from natural and social science with respect to research objectives, research methodologies, and technologies. In some circumstances, data science can be used interchangeably with big data [3]. To get the best out of big data, funding agencies should develop shared tools for optimizing discovery and train a new breed of researchers, says Mattmann [4]. Data Science need not be always for big data; however, the fact that data are scaling up makes big data an important aspect of data science [3].

“Big data” is the most highlighted term in the past 2 years, and it can be expected with much confidence that it would continue to be popular in the next a few years for its promising utility in many fields such as commerce and business, biology, public administration, material science, and cognition in human brain, just to name a few. People from the society of academia, industry, and the open source community have done a lot of work concerning big data analytics.

The studies of big data by academia society could be classified into two categories: basic researches and application researches.

The basic researches of big data are about basic concepts, rules, procedures, and so on. Fisher discussed the challenges lying in the interactions in big data analytics [5]. A community white paper developed by leading researchers across the United States discussed the application of big data in several typical fields and proposed a data analysis pipeline [6]. Recently, Wu presented a HACE theorem that characterizes the features of the big data revolution, and proposed a 3-tiered big data processing model [7]. A close-up view about big data was demonstrated by Chen and Zhang, which included applications, opportunities, and challenges of big data; the state-of-the-art techniques and technologies; as well as several underlying methodologies to handle the data deluge [8]. Han presented a novel skyline algorithm on big data showing significant advantage over the existing skyline algorithms [9], and there are many other researches falling into this category such as [10–13].

Application researches on big data refer to the applications of big data analytics in many different fields. In commerce and business, Chen introduced in detail the evolution of business intelligence, analytics, and the impact of big data in typical areas [14]. In biology, powerful computers and numerous tools for data analysis is crucial in drug discovery and other areas, and biologists get neither their feet nor their hands wet [15]. In public administration, the Trento big data platform offers the service of representing the mean availability of cars in regions of Munich at noon, which can be easily used to improve customer satisfaction, by identifying bottlenecks [16]. In materials science, advances in data analysis have placed the field on the verge of a revolution in how researchers conduct their work, analyze properties and trends in their data, and even discover new materials [17].

There are also quite a few research works which address some challenges in big data analytics with keywords like “huge data,” “large scale dataset,” and “high speed streaming data,”, but no “big data”. These works surely should be noticed and appreciated by big data researchers and practitioners [18–20].

The international IT giants such as Google, IBM, Microsoft, Oracle, and EMC have developed their own big data solution systems and platforms, which are Dremel, InfoSphere BigInsights and InfoSphere Streams, HDInsight, ExaData, Greenplum and so forth [21–26]. Most of the big data platforms are based on Hadoop. Apache also supports other projects related to Hadoop such as HBase, Hive, Pig, Mahout, and Spark, each of which has special effect in dealing with different challenging aspects in big data processing (BDP) [27]. In addition to the projects supported by Apache, there are other open source big data projects, such as Cloudera Impala [28] and RHIPE [29].

The rest of the paper is organized in the following fashion. Section 2 discusses brain big data and its applications. Section 3 introduces the three mechanisms of MGrC and discusses their relationship with five major theoretical models of MGrC. Some key issues of BDP based on MGrC are also analyzed in this section. In Sect. 4, we propose the potential of using MGrC to explore brain big data. The conclusions are drawn in Sect. 5.

2 Brain big data

Among the methods of generating data from natural world and human society, using equipments of fMRI, EEG, and MEG to collect brain data is of great concern from the interdisciplinary researchers of computing, neuroscience, and cognitive psychology [30]. Because the techniques of noninvasive studies of human brain function have been in widespread use to detect metabolic activity and neuronal activity throughout the brain of different subjects all around the world, huge amount of complex datasets are collected every day. There is no doubt that the brain data are a significant category of big data, which hold great potential to unlock mysteries of the human mind [31].

Brain data are in the forms of pulse curve, 2D images [32], and 3D structures reconstructed from 2D images [33], as shown in Fig. 1. Pulse curves are generated by EEG; 2D images are produced by fMRI, MEG, OCT, etc., and 3D structures are reconstructed from 2D images using computer graphics technology. Furthermore, 4D models of the brain can be based on imaging and modeling its 3D structure at a sequence of time-points [33]. We can see that brain data are more complex than regular information tables, which lead to difficulties in modeling and processing of them.

Researches on brain data can achieve a new understanding of the brain, new treatments for brain diseases (such as Alzheimer’s and Parkinson’s [34]), and new brain-like computing technologies [35]. The significance of brain data research had been realized so clearly that governments of the EU and USA started their own brain projects [35, 36]. There have been some successful researches on this field. Ryali described a novel method based on logistic regression using a combination of L1 and L2 norm regularization to identify relevant discriminative brain regions and accurately classify fMRI data [37]. Zhong and Chen proposed Data-Brain, a new conceptual model of brain data, to explicitly represent various relationships among multiple human brain data sources, with respect to all major aspects and capabilities of human information processing systems [32].

3 Multi-granularity computing for big data

“GrC is a superset of the theory of fuzzy information granulation, rough set theory and interval computations, and is a subset of granular mathematics,” stated Zadeh in 1997. Granules are any subsets, classes, objects, clusters, and elements of a universe as they are drawn together by distinguishability, similarity, or functionality [38]. Yao considers GrC to be a label of theories, methodologies, techniques, and tools that make use of granules in the process of problem solving [39]. GrC has become one of the fastest growing information processing paradigms in the domain of computational intelligence and human-centric systems [38]. There are two fundamental issues in GrC: granulation and granular structure. Different semantic aspects and algorithm aspects of granulation will lead to different granular structures of the universe. Chen defined five classes of modal-style operators to construct granular structure and hierarchical structure of data based on the lattice of concepts [40].

Evolved from GrC, MGrC emphasizes jointly utilizing multiple levels of information granules (IG) in problem solving, instead of considering only one optimal granular layer.

3.1 Three basic mechanisms and five theoretical models of MGrC

MGrC considers multiple levels of IG when solving a problem, and there have been a lot researches in this regard [41–45, 62–69]. Three basic mechanisms of MGrC can be summarized from these research works with regard to the way in which multi-granular levels are used in problem solving. They are granularity optimization, granularity conversion, and multi-granularity joint computation. In granularity optimization, the most suitable granular level of a domain is chosen for the multi-granular information/knowledge representation model (MGrR), and the most efficient and satisfactory enough solution is generated on it [41–43]. Granularity conversion means the working granularity layer will be switched between adjective layers or jump to a higher or lower granular layer, in accordance with the requirements of solving a problem [44, 45]. Multi-granularity joint computation takes a problem-oriented MGrR as input, and every layers of the MGrR are employed jointly to achieve a correct solution to the problem. Each of the three mechanisms has its particular type of problem to deal with.

The three basic mechanisms is a new perspective on GrC. Then, what is the relationship between the three mechanisms and models to implement GrC such as fuzzy set, rough set, quotient space, cloud model, and deep learning? We will see that some models suit certain mechanisms better, which are to be introduced in detail as follows.

3.1.1 Granularity optimization

The theories of fuzzy set and rough set are good choices for the mechanism of granularity optimization.

The fuzzy set theory presented by Zadeh in 1965 starts with definitions of membership function, with the more functions defined about an attribute, the attribute is granulated into the finer fuzzy IG. The reason for fuzzy IG is that crisp IG (e.g., an interval is partitioned by exact values) does not reflect the fact that the granules are fuzzy in almost all of human reasoning and concept formation [46, 47]. The number of concepts formed through fuzzy granulation reflects the corresponding granularity being relatively fine or coarse, and decision on the number is an application-specific optimization problem.

The rough set theory developed by Pawlak in 1982 is an effective model to acquire knowledge in information system with upper approximation and lower approximation as its core concepts, making decisions according to the definition of distinguishable relation and attribute reduct. Researchers of related fields have made great variety of improvements to the classic rough set theory mainly by redefining the distinguishable relation and approximation operators [48–50], and integrated it with other knowledge acquisition models, which yield rough neural computation [51], rough fuzzy set and fuzzy rough set [52], and so on.

Rough set can be used to granulate a set of objects into IGs. The grain size of the IG is determined by how many attributes and how many discrete values each attribute takes in the subset of the whole attribute set, which is selected to do the granulation. Generally, the more attributes and the more values each attribute takes, the finer the resulting IGs.

In the perspective of knowledge transformation [53], the process of data analyzing and problem solving by fuzzy sets or rough sets is actually to find a mapping from the information represented by the original finest-grained data to the knowledge hidden behind a set of optimized coarser and more abstract IGs.

3.1.2 Granularity conversion

The quotient space theory proposed by Zhang is a model for problem solving with the basic idea of conceptualizing the world at different granularities and shifting the focus of thinking onto a different abstract level [54, 55]. It is not hard to tell that quotient space is meant to solve problems with need of granularity conversion. In the quotient space theory, a problem space is described by a triplet (X, f, T) with X as its domain, f as its attributes, and T its structure. Suppose R is an equivalence relation on X, [X] is a quotient set under R. Taking [X] as a new domain, we have a new problem space ([X], [f], [T]). The worlds with different granularities are represented by a set of quotient spaces. Based on the descriptions, the construction of different-grain-sized quotient spaces and problem solving on these spaces are researched [55].

The quotient space theory has attracted the attention of researchers from the fields of information science, automatic control, and applied mathematics [56, 57]. Integrating the idea of fuzzy mathematics into quotient space theory, Zhang proposed fuzzy quotient space theory subsequently, which provides a powerful mathematical model and tool for GrC [58, 59]. Fuzzy quotient space theory introduces fuzzy equivalence relation into the construction of quotient space, in which different threshold values of the membership function will lead to quotient spaces of different grain sizes. By setting different threshold values, an MGrR can be derived.

The cloud model proposed by Li realizes the uncertain transformation between qualitative concepts and quantitative values and can be further used to realize the bidirectional cognition, i.e., from concept intension to extension and from extension to intension [60], as shown in Fig. 2. Since a concept definitely has the property of granularity, mapping of quantitative values to a suitable grain-sized qualitative concept is also the process of granularity optimization.

Inspired by the idea of MGrC, Liu constructed an MGrR using cloud model with an Adaptive Gaussian Cloud Transformation (A-GCT) algorithm [61]. Multi-granular concepts are generated by clustering academicians of Chinese Academy of Engineering (ACAE) with regard to age based on the definition of parameter concept clarity, as shown in Fig. 3.

Therefore, granularity conversion can be implemented using cloud model with A-GCT algorithm and a set of different values of parameter concept clarity.

3.1.3 Multi-granularity joint computation

Deep learning is a breakthrough of learning by neural networks in recent years. Starting with Hiton publishing his research on Science magazine in 2006, whose major contribution is using deep auto-encoder networks to learn low-dimensional codes for data of high dimensionality [62], other research works of closely related topics are reported afterward [63–65]. The structure of restricted Boltzmann machines (RBMs) and a deep belief network based on RBMs are shown in Fig. 4. Deep learning also draws the attention from IT industry. Researchers from Google and Stanford University consider the problem of building high-level, class-specific feature detectors from only unlabeled data, and train a face detector on a cluster with 1,000 machines (16,000 cores) for three days, without having to label images as containing a face or not. The experiment obtains 15.8 % accuracy, a leap of 70 % relative improvement over the previous state-of-the-art [66]. Deep learning was selected as the first of the 10 breakthrough technologies 2013 [67].

The core idea of deep learning is training a deep architecture with many layers of neural network, and the constraints between the adjacent layers are set beforehand. Although deep learning does not make the most popular topic in machine learning until 2006, the research with the similar idea can be traced back to 1990s, for example, Jang presented an adaptive-network-based fuzzy inference system (ANFIS) to implement fuzzy inference with a 5-layered neural network [68], as depicted in Fig. 5. Using a hybrid learning procedure, ANFIS can construct an input–output mapping based on both human knowledge (in the form of fuzzy if–then rules) and stipulated input–output data pairs. And we proposed a triple-valued or multiple-valued logic neural network (TMLNN) to represent and process triple-valued or multiple-valued logic knowledge using neural network [69], as illustrated in Fig. 6. The fundamental element of TMLNN is a novel neuron model, triple-valued or multiple-valued logic neuron (TMLN). Each TMLN can represent a triple-valued or multiple-valued logic rule by itself. There are two triple-valued or multiple-valued logic neurons: TMLN-AND (triple-valued or multiple-valued “logic and” neuron) and TMLN-OR (triple-valued or multiple-valued “logic or” neuron).

The application of deep learning to pattern recognition reflects that human firstly takes as input the pixels of an object projected onto the retina and then detects the edges of the object, recognizes the parts of it, and finally forms the high-level abstract concept of the object [70], as shown in Fig. 7a. This means that visual concept formation procedure is local-to-global, which seems to be contradictory to the topologically global first visual perception theory developed by Chen in 1982 [71]. According to the results of experiments conducted by Chen’s team, the human visual system is sensitive to global topological properties, as illustrated in Fig. 7b.

However, after a careful analysis, we realize that the two theories are NOT contradictory. In fact, they reflect the different facets of human visual cognition. Chen’s theory focuses on the last phase of the whole visual concept formation, since the experiments are conducted using noninvasive measurement on human brain cortex. However, visual concept formation in deep learning considers all the organs of visual system and the whole perception process.

What can be learnt from Chen’s global first theory and deep learning from the standpoint of MGrC for BDP is that original finest-grained data (compared to the pixels projected on retina) are the information source for sure, but we should not stick to it. Exploiting components of higher level abstraction (compared to edges and parts) and the relation among them (compared to the topological relations of visual stimuli) is helpful to efficient problem solving.

Deep learning itself is a typical model of multi-granularity joint computation, and it could be expanded to a more general structure for multi-granularity joint computation (MGrJC). Major differences between MGrJC and deep learning are that the input of deep learning is the finest-grained data when MGrJC takes an MGrR as its input, and a layer-wise learner of deep learning is usually a neural network when MGrJC intends to generalize it to any type of learning model.

Although we introduce fuzzy set and rough set in section granularity optimization, and quotient space and cloud model in section granularity conversion, this does not mean that the GrC models are limited to the corresponding mechanisms. Actually, fuzzy set and rough set could be used in granularity conversion, and quotient space and cloud model could be used in granularity optimization as well. The relationship between the three mechanisms and five models is summarized in Table 1.

Table 1 Applicability of the MGrC models to the three MGrC mechanisms

Full size table

3.2 Key issues for BDP

There are quite a few issues that remain unaddressed despite much effort having been made to BDP, among which some are caused by the same reason: always getting start analytics from the original or finest-grained data.

3.2.1 Issue 1: Lacking BDP models of human level machine intelligence (HLMI)

The founder of Fuzzy Set theory, Zadeh argues that the precisiated natural language computing, which originated from CW (computing with words), is the cornerstone of HLMI [72]. The current BDP models fail to simulate human thinking to grasp the proper granularity of information when solving a problem, and thus consequently lose the opportunity to build human-centric data processing systems. The research team led by Chen founded the topologically “global first” visual perception theory in 1982 [71]. Always dealing data from the finest granularity does not accord with this human perception law.

3.2.2 Issue 2: Lacking measures to effectively reduce the size of data in BDP

Volume is the most highlighted challenge when compared to other aspects in BDP, and many difficulties are directly caused by it. To cope with this problem, a straightforward idea is reducing the data size but preserving as much as possible its information, which could avoid excessive reliance on the finest-grained data and reduce the cost in storage and communication.

3.2.3 Issue 3: Lacking the methods to offer effective solution to big data problems with various constraints

There are some situations where a user does not insist on precise answer to a particular problem regarding BDP, since a coarser-grained imprecise result would make him/her happy enough. There are other situations where the precise answer is not available in time due to the problem complexity, data amount and complexity, and the capacity of computing and communication, but if the problem granularity is shifted to a coarser granular level, an imprecise yet acceptable result may be obtained in time. Therefore, it is necessary to introduce a term “effective solution”, which means that the solution meets the requirements of the user regarding granularity and timeliness simultaneously, or in other words, that the solution has a fine enough granularity with respect to the user’s quest and it is delivered in time.

MGrC are able to tackle the issues listed above. For Issue 1, computation with information described in natural language ultimately reduces to computation with granular values, which is the province of GrC [72]. Therefore, MGrC will help BDP move toward HLMI. For Issue 2, multi-granular representation of original data is a form of simplification or abstraction; hence the considerable reduction in data volume can be realized. And when it comes to Issue 3, the most highlighted feature of employing MGrC in BDP is that it can manage to offer effective solution under various constraints.

4 MGrC for brain big data

As mentioned in Sect. 2, the targets for brain BDP achieve a new understanding of the brain, new treatments for brain disease, and new brain-like computing technologies. So the targets are mainly qualitative rather than quantitative, that is, we do not need a solution of precise value or mathematic function, but a result that can be described with words. This is the very province of MGrC.

There have been some related works on pulse signal processing and remote sensing images with GrC methodology, from which the future research on processing brain big data with MGrC can benefit a lot. For example, Gacek and Pedrycz developed a general framework of a granular representation of ECG signals [73], which share many common features with the EEG form of brain data. Furthermore, Gacek recently discussed the granular representation of time series with a number of representation alternatives and the question of forming adjustable temporal slices, and presented an optimization criterion of a sum of volumes of IG [74]. Meher and Pal presented a new rough-wavelet granular space-based model for land cover classification of multispectral remote sensing image [75], which can be used for reference to analyze 2D brain image data.

The three mechanisms of MGrC have great potentials for brain BDP in three facets. Firstly, the brain big data is of multi-granular in nature. As shown in Fig. 8a, EEG signals can be granulated like other kinds of time series [74] and be processed subsequently. For 2D and 3D brain data, they can be viewed hierarchically considering two factors. One is the organizational granularity of brain, e.g., we could view the brain in this hierarchy: whole brain → lobes → gyrus and sulcus → neurons, with lobes, gyrus and sulus shown in Fig. 8b, c, respectively. The other is the measurement granularity of activation degrees in particular region of the subject’s brain, e.g., we could granulate the activation degrees as follows: strongest → very strong → strong → weak → very weak.

Secondly, the computation performed on brain big data needs to be multi-granular and produce results of variable precision. As previously mentioned, the targets for brain BDP could be described with words; thus, they are of multi-granularity. For example, researches on the cure of a kind of brain disease may focus on the changes of certain gyrus and sulcus, while another kind of brain disease needs the neurons of temporal lobe to be investigated. Therefore, granularity optimization mechanism is useful for the former disease and granularity conversion is useful for the latter. And if the brain disease is the result of multiple causes, then multi-granularity joint computation may be required.

Thirdly, identify the proof or signs of the granular thinking in human brain, and offer valuable inspiration to computing technologies. The existence of granular thinking of human beings is already a common sense shared by the cognition and computing society, but to our best knowledge, the process of granularity optimization, granularity conversion, or MGrJC in human thinking has not been explicitly depicted by the equipments of fMRI, EEG, MEG, etc. Therefore, many details of the granular thinking in human brain still remain unknown. Using MGrC to identify and interpret the MGrC occurring in human brain is meaningful for future work.

5 Conclusion

In this paper, we firstly review data space, data science, and researches on BDP, and talk about the source, form, significance, and research works of brain big data. We propose the three mechanisms of MGrC and discuss their relationship with five major models of MGrC, i.e., fuzzy set, rough set, quotient space, cloud model, and deep learning. We also discussed the key issues of current BDP and the reasons why MGrC can tackle them. Then we propose the potential of exploring brain big data with MGrC. Future research may include representing the brain big data from real world with MGrR and conducting intelligent computation based on it to offer effective solution to the problems to do successful research in brain BDP.

References

Zhong N, Liu JM, Yao YY, Wu JL, Lu SF, Qin YL, Li KC, Wah B (2007) Web intelligence meets brain informatics. In: Web intelligence meets brain informatics. Springer, Berlin, pp 1–31
BBC News—England riots: Dangers behind false rumors (2011) http://www.bbc.co.uk/news/uk-14490693
Wikipedia (2014) Data science. http://en.wikipedia.org/wiki/Data_science
Mattmann CA (2013) Computing: a vision for data science. Nature 493:473–475
Article Google Scholar
Fisher D, DeLine R, Czerwinski M et al (2012) Interactions with big data analytics. Interactions 19:50–59
Article Google Scholar
Agrawal D, Bernstein P, Bertino E et al (2012) Challenges and opportunities with big data. http://cra.org/ccc/docs/init/bigdatawhitepaper.pdf
Wu XD, Zhu XQ, Wu GQ, Ding W (2014) Data mining with big data. IEEE Trans Knowl Data Eng 26:97–107
Article Google Scholar
Chen CLP, Zhang CY (2014) Data-intensive applications, challenges, techniques and technologies: a survey on big data. Inf Sci. doi:10.1016/j.ins.2014.01.015
Han XX, Li JZ, Yang DH, Wang JB (2013) Efficient skyline computation on big data. IEEE Trans Knowl Data Eng 25:2521–2535
Article Google Scholar
Havens TC, Bezdek JC, Leckie C, Hall LO, Palaniswami M (2012) Fuzzy c-means algorithms for very large data. IEEE Trans Fuzzy Syst 20:1130–1146
Article Google Scholar
Lu JG, Li DD (2013) Bias correction in a small sample from big data. IEEE Trans Knowl Data Eng 25:2658–2663
Article Google Scholar
Sun WQ, Li FQ, Jin YH, Hu WS (2013) Store, schedule and switch—a new data delivery model in the big data era. In: Proceedings of IEEE 15th international conference on transparent optical networks (ICTON’13), pp 1–4
Zhang LQ et al (2013) Moving big data to the cloud: an online cost-minimizing approach. IEEE J Sel Area Commun 31:2710–2721
Article Google Scholar
Chen H, Chiang RHL, Storey VC (2012) Business intelligence and analytics: from big data to big impact. MIS Q 36:1165–1188
Google Scholar
Marx V (2013) Biology: the big challenges of big data. Nature 498:255–260
Article Google Scholar
Bedini I, Elser B, Velegrakis Y (2013) The Trento big data platform for public administration and large companies: use cases and opportunities. In: Proceeding of VLDB endowment, vol 6, pp 1166–1167
White AA (2013) Big data are shaping the future of materials science. MRS Bull 38:594–595
Article Google Scholar
Raykar VC, Duraiswami R, Krishnapuram B (2008) A fast algorithm for learning a ranking function from large-scale data sets. IEEE Trans Pattern Anal 30:1158–1170
Article Google Scholar
Yan J et al (2006) Effective and efficient dimensionality reduction for large-scale and streaming data preprocessing. IEEE Trans Knowl Data Eng 18:320–333
Article Google Scholar
Wu XD, Yu K, Ding W, Wang H, Zhu XQ (2013) Online feature selection with streaming features. IEEE Trans Pattern Anal 35:1178–1192
Article Google Scholar
Melnik S, Gubarev A, Long JJ, Romer G, Shivakumar S, Tolton M, Vassilakis T (2010) Dremel: interactive analysis of web-scale datasets. In: Proceedings of VLDB Endowment, vol 3, pp 330–339
IBM White Paper (2013) IBM InfoSphere streams—redefining real-time analytics processing. http://www-01.ibm.com/common/ssi/cgi-bin/ssialias?infotype=SA&subtype=WH&htmlfid=IMW14704USEN
IBM White Paper (2013) Using IBM InfoSphere BigInsights to accelerate big data time-to-value. http://public.dhe.ibm.com/common/ssi/ecm/en/imw14684usen/IMW14684USEN.PDF
Microsoft (2014) HDInsight service.http://www.windowsazure.com/en-us/services/hdinsight/
Oracle (2014) Oracle Exadata database machine. http://www.oracle.com/us/products/database/exadata/overview/index.html
EMC (2014) EMC Greenplum data computing appliance. http://www.emc.com/collateral/hardware/data-sheet/h7419-greenplum-dca-ds.pdf
Welcome to Apache Hadoop. http://hadoop.apache.org/
Cloudera (2013) Cloudera Impala. http://www.cloudera.com/content/cloudera/en/products-and-services/cdh/impala.html
Department of Statistics of Purdue University (2012) Divide and Recombine (D&R) with RHIPE. http://www.datadr.org/
Zhong N, Bradshaw JM, Liu JM, Taylor JG (2011) Brain informatics. IEEE Intell Syst 26:16–21
Article Google Scholar
Turk-Browne NB (2013) Functional interactions as big data in the human brain. Science 342:580–584
Article Google Scholar
Zhong N, Chen J (2012) Constructing a new-style conceptual model of brain data for systematic brain informatics. IEEE Trans Knowl Data Eng 24:2127–2142
Article Google Scholar
Toga AW, Thompson PM (2001) Maps of the brain. Anat Rec 265:37–53
Article Google Scholar
Michael K, Mille KW (2013) Big data: new opportunities and new challenges. IEEE Comput 46:22–24
Article Google Scholar
The Human Brain Project—A Report to the European Commission (2012) https://www.humanbrainproject.eu/documents/10180/17648/TheHBPReport_LR.pdf/18e5747e-10af-4bec-9806-d03aead57655
White House (2013) Fact Sheet: BRAIN initiative. http://www.whitehouse.gov/the-press-office/2013/04/02/fact-sheet-brain-initiative
Ryali S, Supekar K, Abrams DA, Menon V (2010) Sparse logistic regression for whole-brain classification of fMRI data. NeuroImage 51:752–764
Article Google Scholar
Yao JT, Vasilakos AV, Pedrycz W (2013) Granular computing: perspectives and challenges. IEEE Trans Cybern 43:1977–1989
Article Google Scholar
Yao YY (2000) Granular computing: basic issues and possible solutions. In: Proceedings of 5th joint conference on information sciences, vol 1, Atlantic, pp 186–189
Chen YH, Yao YY (2008) A multiview approach for intelligent data analysis based on data operators. Inf Sci 178:1–20
Article MATH MathSciNet Google Scholar
Nakatsuji M, Fujiwara Y (2014) Linked taxonomies to capture users’ subjective assessments of items to facilitate accurate collaborative filtering. Artif Intell 207:52–68
Article MathSciNet Google Scholar
Pedrycz W (2014) Allocation of information granularity in optimization and decision-making models: towards building the foundations of granular computing. Eur J Oper Res 232:137–145
Article Google Scholar
Pedrycz W, Homenda W (2013) Building the fundamentals of granular computing: a principle of justifiable granularity. Appl Soft Comput 13:4209–4218
Article Google Scholar
McCalla G, Greer J, Barrie B et al (1992) Granularity hierarchies. Comput Math Appl 23:363–375
Article MATH Google Scholar
Zhu P, Hu Q (2013) Adaptive neighborhood granularity selection and combination based on margin distribution optimization. Inf Sci 249:1–12
Article MathSciNet Google Scholar
Zadeh LA (1997) Toward a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic. Fuzz Sets Syst 90:111–127
Article MATH MathSciNet Google Scholar
Zadeh LA (2008) Is there a need for fuzzy logic? Inf Sci 178:2751–2779
Article MATH MathSciNet Google Scholar
Wang GY, Yao YY, Yu H (2009) A survey on rough set theory and applications. Chin J Comput 32:1229–1246
Article MathSciNet Google Scholar
Wang GY (2002) Extension of rough set under incomplete information systems. In: Proceedings 2002 IEEE international conference on fuzzy systems, vol 2, pp 1098–1103
Wang GY (2003) Rough reduction in algebra view and information view. Int J Intell Syst 18:679–688
Article MATH Google Scholar
Peters JF, Szczuka MS (2002) Rough neurocomputation: a survey of basic models of neurocomputation. In: Proceedings of 3rd international conference (RSCTC 2002), Malvern, PA, pp 308–315
Dubois D, Prade H (1990) Rough fuzzy sets and fuzzy rough sets. Int J Gen Syst 17:191–209
Article MATH Google Scholar
Wang GY, Wang Y (2009) 3DM: domain-oriented data-driven data mining. Fund Inf 90:395–426
MATH Google Scholar
Zhang B, Zhang L (2007) Theory of problem solving and its applications, 2nd edn. Tsinghua University Press, Beijing in Chinese
Google Scholar
Zhang L, Zhang B (2004) The quotient space theory of problem solving. Fund Inf 59:287–298
MATH Google Scholar
Wu D, Ban XJ, Oquendo F (2012) An architecture model of distributed simulation system based on quotient space. Appl Math 6:603S–609S
Google Scholar
Zhang C, Zhang Y, Wu XP (2011) Audio signal blind deconvolution based on the quotient space hierarchical theory. In: Rough sets and knowledge technology. Springer, Berlin, pp 585–590
Zhang L, Zhang B (2003) Theory of fuzzy quotient space (methods of fuzzy granular computing). J Softw 14:770–776 (in Chinese with English abstract)
MATH Google Scholar
Zhang L, Zhang B (2005) Fuzzy reasoning model under quotient space structure. Inf Sci 173:353–364
Article MATH Google Scholar
Li DY et al (2008) Artificial intelligence with uncertainty. Chapman &Hall/CRC Press, Boca Raton
Liu YC, Li DY, He W, Wang GY (2013) Granular computing based on gaussian cloud transformation. Fund Inf 127:385–398
Google Scholar
Hinton GE, Salakhutdinov RR (2006) Reducing the dimensionality of data with neural networks. Science 313:504–507
Article MATH MathSciNet Google Scholar
Bengio Y (2009) Learning deep architectures for AI. Found Trends Mach Learn 2:1–127
Article MATH Google Scholar
Bengio Y, Lamblin P, Popovici D et al (2007) Greedy layer-wise training of deep networks. Adv Neural Inf Process Syst 19:153
Google Scholar
Bengio Y, Courville A, Vincent P (2013) Representation learning: a review and new perspectives. IEEE Trans Pattern Anal Mach Intell 35:1798–1828
Article Google Scholar
Le QV, Ranzato MA, Monga R et al (2011) Building high-level features using large scale unsupervised learning. arXiv preprint. arXiv:1112.6209
Breakthrough Technologies (2013) http://www.technologyreview.com/lists/breakthrough-technologies/2013/
Jang JSR (1993) ANFIS: adaptive-network-based fuzzy inference system. IEEE Trans SMC 23:665–685
Google Scholar
Wang GY, Shi HB (1998) TMLNN: triple-valued or multiple-valued logic neural network. IEEE Trans Neural Netw 9:1099–1117
Article Google Scholar
Lee H, Grosse R, Ranganath R, Ng AY (2009) Convolutional deep belief networks for scalable unsupervised learning of hierarchical representations. In: Proceedings of ICML 2009. ACM, New York, pp 609–616
Chen L (1982) Topological structure in visual perception. Science 218:699
Article Google Scholar
Zadeh LA (2008) Toward human level machine intelligence-is it achievable? The need for a paradigm shift. IEEE Comput Intell Mag 3:11–22
Article Google Scholar
Gacek A, Pedrycz W (2006) A granular description of ECG signals. IEEE Trans Biomed Eng 53:1972–1982
Article Google Scholar
Gacek A (2013) Granular modelling of signals: a framework of granular computing. Inf Sci 221:1–11
Article Google Scholar
Meher SK, Pal SK (2011) Rough-wavelet granular space and classification of multispectral remote sensing image. Appl Soft Comput 11:5662–5673
Article Google Scholar

Download references

Acknowledgments

This work is supported by the National Science Foundation of China (No. 61073146 and No. 61272060) and the Natural Science Foundation Key Project of Chongqing of P.R. China under the Grant No. CSTC2013jjB40003.

Author information

Authors and Affiliations

School of Information Science & Technology, Southwest Jiaotong University, Chengdu, 610031, China
Guoyin Wang & Ji Xu
Key Laboratory of Computational Intelligence, Chongqing University of Posts and Telecommunications, Chongqing, 400065, China
Guoyin Wang
Information Engineering School, Guizhou University of Engineering Science, Bijie, 551700, Guizhou, China
Ji Xu

Authors

Guoyin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ji Xu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ji Xu.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Reprints and permissions

About this article

Cite this article

Wang, G., Xu, J. Granular computing with multiple granular layers for brain big data processing. Brain Inf. 1, 1–10 (2014). https://doi.org/10.1007/s40708-014-0001-z

Download citation

Received: 17 April 2014
Accepted: 08 July 2014
Published: 06 September 2014
Issue Date: December 2014
DOI: https://doi.org/10.1007/s40708-014-0001-z

Granular computing with multiple granular layers for brain big data processing

Abstract

1 Introduction of big data and data science

2 Brain big data

3 Multi-granularity computing for big data

3.1 Three basic mechanisms and five theoretical models of MGrC

3.1.1 Granularity optimization

3.1.2 Granularity conversion

3.1.3 Multi-granularity joint computation

3.2 Key issues for BDP

3.2.1 Issue 1: Lacking BDP models of human level machine intelligence (HLMI)

3.2.2 Issue 2: Lacking measures to effectively reduce the size of data in BDP

3.2.3 Issue 3: Lacking the methods to offer effective solution to big data problems with various constraints

4 MGrC for brain big data

5 Conclusion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords