Satpute, Ankit Suresh (2021) Feature Based Representative Workload Construction from Traces. Master's, BU Weimar.
Full text not available from this repository.
Abstract
Storage system traces are rich in information as it contains real-world behavior. Replaying already recorded traces is used to reproduce the realistic behavior of systems as accurately as possible. The growing popularity of object storage systems and less focus on creating precise trace replay workload leads this work to focus on identifying components core to currently existing object stores. In this work, a novel features-based method for constructing representative traces from original traces is proposed, where the former serves as an approximation with information from later. Time-related patterns that occurred in the original trace are kept in the representative trace to exhibit the trigger time of operations and application profile. Users can configure the proposed pipeline to evaluate object storage systems and any other storage or file system that uses trace replay workload. A quantifying metric is proposed, which helps analyze the created workload accurately. This also assists in investigating the coverage of features in the representative workload from the original workload. This work tries to solve a computationally complex problem with the help of a greedy algorithm that uses submodularity and guarantees the representation of distinct attributes in the final set of operations. Using the comparison of different configuration types and varying settings of parameters, we show that the method proposed can be easily extended to other applications or for benchmarking of any other system. This work facilitates understanding traces and contributes to improving links to the factors of the system under evaluation.
Item URL in elib: | https://elib.dlr.de/148320/ | ||||||||
---|---|---|---|---|---|---|---|---|---|
Document Type: | Thesis (Master's) | ||||||||
Title: | Feature Based Representative Workload Construction from Traces | ||||||||
Authors: |
| ||||||||
Date: | 20 December 2021 | ||||||||
Refereed publication: | No | ||||||||
Open Access: | No | ||||||||
Number of Pages: | 65 | ||||||||
Status: | Unpublished | ||||||||
Keywords: | data storage systems, trace replays, benchmarking | ||||||||
Institution: | BU Weimar | ||||||||
Department: | Fakultät Medien | ||||||||
HGF - Research field: | Aeronautics, Space and Transport | ||||||||
HGF - Program: | Space | ||||||||
HGF - Program Themes: | Space System Technology | ||||||||
DLR - Research area: | Raumfahrt | ||||||||
DLR - Program: | R SY - Space System Technology | ||||||||
DLR - Research theme (Project): | R - New Data Management Techniques for Earth Observation | ||||||||
Location: | Jena | ||||||||
Institutes and Institutions: | Institute of Data Science > Datamangagement and Analysis | ||||||||
Deposited By: | Paradies, Dr.-Ing. Marcus | ||||||||
Deposited On: | 17 Jan 2022 09:22 | ||||||||
Last Modified: | 17 Jan 2022 09:22 |
Repository Staff Only: item control page