Student's Blog | Suman Kundu

Big Data Challenges in Cyber Security

Abstract With the recent development in Computer Technology there is rapid increase in amount of data which is stored on Internet, on social network, bank-data, organization's data etc. With this increase of size of data there are some challenges also arising to safeguard this data and provide privacy to holders of the data[1] Introduction In the recent years the size of data in various sectors online like social Networking, E-commerce, etc is increasing and it can not be stored or manipulated using traditional Database Management, thus leading us to Big data Technology.

Merkle Tree

Introduction Merkle Tree [1] is an authenticated data structure, also known as Merkle hash Tree. It is an authenticated binary search tree.It is a fundamental part of the blockchain. It uses hash functions like MD5, SHA, etc. In the Fig1. Shows a hash binary tree, so the leaf nodes contain the hash value of the block of transaction. The internal node contains the hash value of the concatenated value of leaf nodes.

Handling heterogeneous data in Big Data

As we know that now a days Big Data is very hot topic among the IT field guys. We can also say that Big Data is emerging problem which is growing day-by-day. Big Data results from verious sources like: medical sector, industrial sector, research and development sector, etc. Many big IT giants such as google, facebook, amazon, etc. invests a lot of resources like manpower, machines and energy in this field.

Retrieval of indexed multilingual Information

Introduction Nowadays the data from various sources like database,internet,social media,etc are very huge in size. For retrieving data relevant to a given query requires fetching information from this huge data which is a difficult task without proper information retrieval technique. Information retrieval is the science of efficiently retrieving information from large data which may be a document,a database of texts,images or sounds.To enhance information retrieval indexing needs to be used on data.

Representation of words of big text data in vector space

Introduction: Many research works have been proposed to extract knowledge out of text documents because lots of data is availabale in text format only. Extracting knowledge includes topic detection, translation of document, sentiment analysis. This is done by converting the text documents into vector representations. There are few methods of converting the words to vectors like count vectoriser, TF-IDF vectoriser etc. But the above methods mentioned doesnot preserve semantic relationship among the words.

What, When and How of Big Data Visualization

What is VISUALIZATION? Data visualization is the visual representation of information by the use of graphs, charts, maps and other graphical diagrams. It is an imagery representation of data, making it easily understandable and gain insights from it. How IMPORTANT is it? Data visualization may often be depicted as just “pretty pictures” because for most people, it is easier to look at the pictures than to actually read the content/data and extract information from it.

Indexing for Text Documents

Introduction As data increasing exponentially day by day by using personal computer(PC),smart phones,Global Positioning System (GPS),sensors,radio frequency identification (RFID),monitoring devices or mainly due to Social network or application(video streaming mainly) According to Zhou , in the end of 2015 7.92 zetta bytes and predicted at the time of 2020 number will increase to 35 zetta bytes data[1]. Therefore, we get inefficient result when we search for result and query or retrieval information.