applsci-logo

Journal Browser

Journal Browser

Advances in Distributed and Parallel Big Data Processing

A special issue of Applied Sciences (ISSN 2076-3417). This special issue belongs to the section "Computing and Artificial Intelligence".

Deadline for manuscript submissions: closed (30 January 2024) | Viewed by 1224

Special Issue Editor


E-Mail Website
Guest Editor
Department of Computer Science and Information Engineering, National Taiwan University of Science and Technology, Taipei 10672, Taiwan
Interests: deep learning and big data; biometric recognition; information security; cloud and fault computing; multimedia applications; medical applications
Special Issues, Collections and Topics in MDPI journals

Special Issue Information

Dear Colleagues,

Big data refers to a large-scale and complex collection of data that span various fields such as business, scientific research, healthcare, and finance, among others. Through big data analysis, hidden patterns and trends can be discovered, providing insights into market trends, user behavior, disease forecasting, and more.

To handle the scale and complexity of big data, parallel or distributed techniques are required. Traditional serial processing methods are inefficient for processing large datasets; thus, parallel and distributed computing have become crucial. Parallel computing involves dividing tasks into multiple sub-tasks and executing them simultaneously on multiple processing units. Distributed computing involves distributing data across multiple computing nodes, with each node responsible for processing a portion of the data, and communication and coordination occur over a network. These techniques leverage the computational and storage capabilities of multiple computers to enhance data processing speed and efficiency.

Big data require the adoption of parallel or distributed techniques to effectively handle their scale and complexity. These techniques enable the efficient processing of big data. The application prospects of big data are vast and will continue to have a significant impact across various domains.

We welcome new and unpublished papers on, but not limited to, the following topics:

  • Mass data stream processing in the cloud;
  • Big data models and computation theory;
  • Big data mining and fusion;
  • Dimension reduction for large data sets;
  • Big data placement, scheduling and optimization;
  • Multi-source data processing and integration;
  • Deep learning models;
  • Deep learning applications;
  • Cloud security;
  • Data privacy protection;
  • Access control;
  • Data provenance.

Prof. Dr. Shi-Jinn Horng
Guest Editor

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Applied Sciences is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2400 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

  • distributed computing
  • parallel computing
  • big data
  • data mining
  • deep learning

Benefits of Publishing in a Special Issue

  • Ease of navigation: Grouping papers by topic helps scholars navigate broad scope journals more efficiently.
  • Greater discoverability: Special Issues support the reach and impact of scientific research. Articles in Special Issues are more discoverable and cited more frequently.
  • Expansion of research network: Special Issues facilitate connections among authors, fostering scientific collaborations.
  • External promotion: Articles in Special Issues are often promoted through the journal's social media, increasing their visibility.
  • e-Book format: Special Issues with more than 10 articles can be published as dedicated e-books, ensuring wide and rapid dissemination.

Further information on MDPI's Special Issue polices can be found here.

Published Papers (1 paper)

Order results
Result details
Select all
Export citation of selected articles as:

Research

17 pages, 12717 KiB  
Article
SPinDP: A High-Speed Distributed Processing Platform for Sampling and Filtering Data Streams
by Myeong-Seon Gil and Yang-Sae Moon
Appl. Sci. 2023, 13(24), 12998; https://doi.org/10.3390/app132412998 - 5 Dec 2023
Cited by 1 | Viewed by 847
Abstract
Recently, there has been an explosive generation of streaming data in various fields such as IoT and network attack detection, medical data monitoring, and financial trend analysis. These domains require precise and rapid analysis capabilities by minimizing noise from continuously generated raw data. [...] Read more.
Recently, there has been an explosive generation of streaming data in various fields such as IoT and network attack detection, medical data monitoring, and financial trend analysis. These domains require precise and rapid analysis capabilities by minimizing noise from continuously generated raw data. In this paper, we propose SPinDP (Stream Purifier in Distributed Platform), an open source-based high-speed stream purification platform, to support real-time stream purification. SPinDP consists of four major components, Data Stream Processing Engine, Purification Library, Plan Manager, and Shared Storage, and operates based on open-source systems including Apache Storm and Apache Kafka. In these components, stream processing throughput and latency are critical performance metrics, and SPinDP significantly enhances distributed processing performance by utilizing the ultra-high-speed network RDMA (Remote Direct Memory Access). For the performance evaluation, we use a distributed cluster environment consisting of nine nodes, and we show that SPinDP’s stream processing throughput is more than 28 times higher than that of the existing Ethernet environment. SPinDP also significantly reduces the processing latency by more than 2473 times on average. These results indicate that the proposed SPinDP is an excellent integrated platform that can efficiently purify high-speed and large-scale streams through RDMA-based distributed processing. Full article
(This article belongs to the Special Issue Advances in Distributed and Parallel Big Data Processing)
Show Figures

Figure 1

Back to TopTop