Data Computing Systems Lab

Publications

Journals
Yoonseok Kang and Dongchul Park, “Optimizing Indexing and Search Performance of Elasticsearch for Large-Scale Log Data (Elasticsearch 기반 대규모 로그 데이터의 인덱싱 및 검색 성능 최적화),” Journal of Platform Technology, March 10, 2025 (accepted). (KCI)
Jinyoung Shin and Dongchul Park, “Performance Optimization of Apache Spark on High-Performance Server for Large-Scale Data Processing (고성능 서버 기반 아파치 스파크의 대규모 데이터 처리 성능 최적화),” Journal of Platform Technology, February 23, 2025 (under review). (KCI)
Yeongmo Lee and Dongchul Park, “Exploring Impacts and Potentials of Unlocked I/O on Single Board Computer Clusters,” Journal of Big Data, November 7, 2024 (under review). (JCR IF top 5%)
Sooyoung Lim and Dongchul Park, “Automatic Reconfiguring the Node-Level Parallelism of YARN in Heterogeneous Low-Power Clusters,” Journal of Big Data, September 18, 2024 (under review). (JCR IF top 5%)
Hyerim Lee and Dongchul Park, “Multigrain: Adaptive Multilevel Hot Data Identifier with a Stack Distance-based Prefilter,” Future Generation Computer Systems, February 7, 2025 (accepted). (JCR IF top 9%)
Sooyoung Lim and Dongchul Park, “Improving Hadoop MapReduce performance on heterogeneous single board computer clusters,” Future Generation Computer Systems, Vol. 160, No. C, pp. 752-766, November 1, 2024. (JCR IF top 9%)
Nayeon Keum and Dongchul Park, “Real-Time Indexing Performance Optimization of Search Platform Based on Big Data Cluster (빅데이터 클러스터 기반 검색 플랫폼의 실시간 인덱싱 성능 최적화),” Journal of Platform Technology, Vol. 11, No. 6, pp. 89-105, December 31, 2023. (KCI)
Eunseo Lee and Dongchul Park, “Performance Analysis of Real-Time Big Data Search Platform Based on High-Capacity Persistent Memory (대용량 영구 메모리 기반 실시간 빅데이터 검색 플랫폼 성능 분석),” Journal of Platform Technology, Vol. 11, No. 4, pp. 50-61, August 31, 2023. (KCI)
Sooyoung Lim and Dongchul Park, "Efficient Stack Distance Approximation Based on Workload Characteristics." IEEE Access, Vol. 10, pp. 59792-59805, June 06, 2022.
Eunseo Lee, Hyunju Oh and Dongchul Park, “Big Data Processing on Single Board Computer Clusters: Exploring Challenges and Possibilities.” IEEE Access, Vol. 9, pp. 142551-142565, October 15, 2021.
Hyeonji Ha, Daeun Shim, Hyeyin Lee and Dongchul Park, “Dynamic Hot Data Identification Using a Stack Distance Approximation.” IEEE Access, Vol. 9, pp. 79889-79903, May 28, 2021.
Yoonjee Kim and Dongchul Park, “Multiple Bloom Filter and Multiple Hash-based Hot Data Classification for Flash Memory Storage.” Communications of the Korean Institute of Information Scientists and Engineer (Communications of KIISE), Vol. 37, No. 6, pp.24-34, June 1, 2019.
Ziqi Fan, Dongchul Park. “Extending SSD Lifespan with Cooperative Non-Volatile Memory-based Write Buffers.” Journal of Computer Science and Technology (JCST), Vol.34, No.1, pp.113-132, January 18, 2019.
Dongchul Park, Weiping He, David H.C. Du. “Hot Data Identification with Multiple Bloom Filters: Block-level Decision vs. I/O Request-level Decision.” Journal of Computer Science and Technology (JCST), Vol.33, No.1, pp.79-97, January 26, 2018.
Dongchul Park, Ziqi Fan, Young Jin Nam, and David H.C. Du. “A Lookahead Read Cache: Improving Read Performance for Deduplication Backup Storage.” Journal of Computer Science and Technology (JCST), Vol.32, No.1, pp.26-40, January 11, 2017.
Jianguo Wang, Dongchul Park, Yannis Papakonstantinou, Steven Swanson. “SSD In-Storage Computing for Search Engines.” IEEE Transactions on Computers (TC), DOI: 10.1109/TC.2016.2608818, September 13, 2016.
Dongchul Park, Jianguo Wang, Yang-Suk Kee. “In-Storage Computing for Hadoop MapReduce Framework: Challenges and Possibilities.” IEEE Transactions on Computers (TC), DOI: 10.1109/TC.2016.2595566, July 28, 2016.
Dongchul Park, Biplob Debnath, David H.C. Du. “A Dynamic Switching Flash Translation Layer based on A Page-Level Mapping.” IEICE Transactions on Information and Systems, Vol.E99-D, No.6, pp.1502-1511, June 01 2016.
Dongchul Park, Biplob Debnath, Young Jin Nam, David H.C. Du, Youngkyun Kim and Youngchul Kim. “An On-line Hot Data Identification for Flash-based Storage using Sampling Mechanism.” ACM SIGAPP Applied Computing Review (ACR), Vol.13, No.1, pp.51-64, March 2013.
Jaewon Oh, Jongwon Lee, Dongchul Park, ByungJeong Lee, and Chisu Wu. “A Metamodel for Creation and Maintenance of Evaluation Set of Software Package Evaluation.” The Korea Information Processing Society (KIPS)Transactions, Part D, Vol.11-D, No.03, pp.577-590, June 2004.
Jaewon Oh, Dongchul Park, Jongwon Lee, ByungJeong Lee, Euyseok Hong, and Chisu Wu. “Certification of software packages using hierarchical classification.” Springer Lecture Notes in Computer Science (LNCS), Vol.3026, pp.209-224, April 2004.
Conferences / Workshops
Hanyeoreum Bae, Miryeong Kwon, Donghyun Gouk, Sanghyun Han, Sungjoon Koh, Changrim Lee, Dongchul Park, Myoungsoo Jung, "Slow is Fast: Rethinking In-Memory Graph Analysis with Persistent Memory," The 13th Annual Non-Volatile Memory Workshop (NVMW'22), San Diego, CA, USA, May 9-10, 2022.
Hyerim Lee, Yibin, Yun, and Dongchul Park, "Hot Data Identification based on Naive Bayes Classifier (나이브 베이즈 분류 기반의 핫 데이터 구분 기법)," Annual Conference of KIPS, Chuncheon, Korea, November 3-5, 2022.
Hanyeoreum Bae, Miryeong Kwon, Donghyun Gouk, Sanghyun Han, Sungjoon Koh, Changrim Lee, Dongchul Park and Myoungsoo Jung, “Empirical Guide to Use of Persistent Memory for Large-Scale In-Memory Graph Analysis.” In Proceedings of the IEEE International Conference on Computer Design (ICCD’21), Virtual Conference, October 24-27, 2021.
Manas Minglani, Jim Diehl, Xiang Cao, Bingzhe Li, Dongchul Park, David J. Lilja, David H.C. Du. “Kinetic Action: Performance Analysis of Integrated Key-Value Storage Devices vs. LevelDB Servers.” In Proceedings of the 23rd IEEE International Conference on Parallel and Distributed Systems (ICPADS’17), pp.501-510, Shenzhen, China, December 15-17, 2017.
Ziqi Fan, Fenggang Wu, Dongchul Park, Jim Diehl, Doug Voigt and David H.C. Du, “Hibachi: A Cooperative Hybrid Cache with NVRAM and DRAM for Storage Arrays,” In Proceedings of the 33rd IEEE Symposium on Mass Storage Systems and Technologies (MSST’17), pp.1-11, Santa Clara, CA, USA, May 15-19, 2017.
Jianguo Wang, Dongchul Park, Yang-Suk Kee, Yannis Papakonstantinou and Steven Swanson. “SSD In-Storage Computing for List Intersection.” In Proceedings of the 12th International Workshop on Data Management on New Hardware (DaMoN’16), in conjunction with SIGMOD'16, San Francisco, CA, USA, June 27, 2016.
Chung-I Lin, Dongchul Park, Weiping He and David H.C. Du. “H-SWD: Incorporating Hot Data Identification into Shingled Write Disks.” In Proceedings of the 20th IEEE International Symposium on Modeling, Analysis and Simulations of Computer and Telecommunication Systems (MASCOTS '12), pp.321-330, Arlington, VA, USA, August 7-9, 2012.
Young Jin Nam, Dongchul Park, and David H.C. Du. “Assuring Demanded Read Performance of Data Deduplication Storage with Backup Datasets.” In Proceedings of the 20th IEEE International Symposium on Modeling, Analysis and Simulations of Computer and Telecommunication Systems (MASCOTS'12), pp.201-208, Arlington, VA, USA, August 7-9, 2012.
Dongchul Park, Biplob Debnath, Young Jin Nam, David H.C. Du, Youngkyun Kim, and Youngchul Kim. “HotDataTrap: A Sampling-based Hot Data Identification Scheme for Flash Memory.” In Proceedings of the 27th ACM Symposium on Applied Computing (SAC '12), pp.1610-1617, Italy, March 26-30, 2012.
Young jin Nam, Dongchul Park and David H.C. Du. “Virtual USB Drive: A Key Component for Smart Home Storage Architecture.” In Proceedings of the 30th IEEE International Conference on Consumer Electronics (ICCE '12), Las Vegas, USA, January 13-16, 2012.
Dongchul Park, Biplob Debnath, and David H.C. Du, “A Workload-Aware Adaptive Hybrid Flash Translation Layer with an Efficient Caching Strategy,” In Proceedings of the 19th IEEE International Symposium on Modeling, Analysis and Simulations of Computer and Telecommunication Systems (MASCOTS’11), pp.248-255, Singapore, July 25-27, 2011.
Dongchul Park and David H.C. Du, “Hot Data Identification for Flash-based Storage Systems using Multiple Bloom Filters,” In Proceedings of the 27th IEEE Symposium on Mass Storage Systems and Technologies (MSST’11), pp.1-11, Denver, CO, USA, May 23-27, 2011.
Dongchul Park, Biplob Debnath, and David H.C. Du, “CFTL: A Convertible Flash Translation Layer Adaptive to Data Access Patterns,” In Proceedings of ACM SIGMETRICS international conference on Measurement and Modeling of computer systems (SIGMETRICS’10). pp.365-366, New York, NY, USA, June 14-18, 2010 (short paper).
Dongchul Park, “Translation of Safety-Critical Software Requirements Specification to Lustre,” In Proceedings of International Joint Conferences on Computer, Information, and System Sciences, and Engineering (CISSE’06), pp.157-162, December 5-14, 2006.
Dongchul Park, Jaewon Oh, Jongwon Lee, and Chisu Wu. “Quality certification based on hierarchical classification of software packages.” In Proceeding of the 7th IEEE Korea-Russia International Symposium on Science and Technology (KORUS’03), pp.148-154, Ulsan, Korea, June 27-July 1, 2003.
Jaewon Oh, ByungJeong Lee, Dongchul Park, Jongwon Lee, Euyseok Hong, and Chisu Wu. “Using Hierarchical Classification to Certify Software Packages.” In Proceeding of the ACIS international conference on Software Engineering Research, Management and Application (SERA’03), pp.270-275, San Francisco, CA, USA, June 25-27, 2003.
Patents
Dongchul Park and Yeongmo Lee. “A prompt injection protection mechanism using the risk scoring technique (위험점수 라벨링 기반 프롬프트 인젝션 공격 방어 기법),” Korea Patent, CAU20250151KR, April 12, 2025 (Application submitted).
Dongchul Park and Yeongmo Lee. “Adaptive Analysis and Risk-Based Filtering for Prompt Injection Defense (문장유사도 비교를 기반으로한 프롬프트 인젝션 방어 기법),” Korea Patent, CAU20250140KR, April 4, 2025 (Application submitted).
Dongchul Park and Dayeon Jung. “Method for providing a homomorphic encryption personal information protection service based on homomorphic encryption, and apparatus thereof (동형암호를 기반으로 한 동형 암호 개인정보 보호 서비스 제공 방법 및 그 장치),” Korea Patent, 10-2025-0020266, February 18, 2025 (Patent pending).
Dongchul Park and Sooyoung Lim. “Method and device for allocating Mapreduce task in heterogeneous cluster environment (이기종 클러스터 환경에서 맵리듀스 작업 할당 방법 및 장치),” Korea Patent, 10-2025-0008411, January 21, 2025 (Patent pending).
Dongchul Park and Hyerim Lee. “A multi-layer hot-data classification method with stack distance-based pre-filtering mechanism, and apparatus thereof (스택 거리 기반 사전 필터링 메커니즘을 가진 다층 핫데이터 구분 방법 및 그 장치),” Korea Patent, 10-2024-0193958, December 23, 2024 (Patent pending).
Dongchul Park and Yeongmo Lee. “Method for automatic tuning for reverse-forward related cavity filters, and apparatus thereof (분리된 LLM 모델 및 벡터 데이터베이스를 활용한 질문 유형 분류 활용 방법 및 그 장치),” Korea Patent, 10-2024-0178116, December 4, 2024 (Patent pending).
Hangbae Chang, Dongchul Park, Jawon Kim. “SBoM creation system for safe technology development in digital financial environment (디지털 금융환경의 안전한 기술개발을 위한 SBoM 생성 시스템),” Korea Patent, 10-2023-0193411, December 27, 2023 (Patent pending).
Dongchul Park, Hyeonji Ha, Daeun Shim and Hyeyin Lee. “Electronic device for efficiently estimating stack distance and operation method of the same (스택거리를 효율적으로 추정하는 전자장치 및 이의 동작 방법),” Korea Patent, 1024913520000, January 18, 2023 (Registered).
Dongchul Park and Yang Seok Ki. “Computing System with Processing and Method of Operation Thereof.” US Patent, 10,198,185, February 5, 2019 (with Samsung) (Registered).
Dongchul Park, Hyeonji Ha, Daeun Shim and Hyeyin Lee. “Electronic device for efficiently estimating stack distance and operation method of the same” Korea Patent, 10-2021-0047747, April 13, 2021.
Dongchul Park, Daeun Shim, Hyeonji Ha and Hyeyin Lee “Data Storage device dynamically determining whether data is hot or cold and operation method of the same” Korea Patent, 10-2021-0044225, April 5, 2021.
Dongchul Park. “System for managing ransomware test using virtual machine technologies and method therefor” Korea Patent, 10-2019-0021247, February 22, 2019.
Sanjeev N. Trika, Dongchul Park, Peng Li, Francis R. Corrado, Robert A. Dickinson. “A Data Management System Employing A Hash-Based and Tree-Based Key-Value Data Structure.” US Patent, 20190034427, January 31, 2019 (with Intel).
Dongchul Park and Yang Seok Ki. “Computing System with Processing and Method of Operation Thereof.” US Patent, 10,198,185, February 5, 2019 (with Samsung).
Yangwook Kang, Yang Seok Ki, and Dongchul Park. “Computing System with Distributed Compute-Enabled Storage Group and Method of Operation Thereof.” US Patent, 20160191665, June 30, 2016 (with Samsung).
Technical Presentation (WiPs / Posters)
Dongchul Park, Youngjin Nam, Biplob Debnath and David H.C. Du. HotDataTrap: A Hot Data Identification Scheme with Sampling Mechanism for Flash Memory. 3rd Annual Non-Volatile Memories Workshop (NVMW 2012), San Diego, CA, USA, March 4-6, 2012.
Dongchul Park, Chung-I Lin and David H.C. Du. H-SWD: A Novel Shingled Write Disk Scheme based on Hot and Cold Data Identification. Work-in-Progress (WiPs) presentation, 10th USENIX Conference on File and Storage Technologies (FAST '12), San Jose, CA, USA, February 14-17, 2012. (25% acceptance rate)
Dongchul Park, Biplob Debnath and David H.C. Du. An Adaptive Hybrid Flash Translation Layer with Efficient Caching Strategies. 2nd Annual Non-Volatile Memories Workshop (NVMW 2011), San Diego, CA, USA, March 6-8, 2011.
Dongchul Park and David H.C. Du. Hot Data Identification for Flash Memory using Multiple Bloom Filters. 9th USENIX Conference on File and Storage Technologies (FAST '11), San Jose, CA, USA, February 15-17, 2011.
Dongchul Park, Anna Kryzhnyaya, David H.C. Du and Cory Devor. Storage Architectures for Ultra-Scale Digital Media Archives. A National Science Foundation (NSF) Industry/University Cooperative Research Center (I/U CRC), Center for Research in Intelligent Storage (CRIS) Workshop, Minneapolis, MN, USA, August 11-12, 2010.
Dongchul Park, Anna Kryzhnyaya, David H.C. Du and Cory Devor. Storage Architecture for Extreme Scale Media Archives. A National Science Foundation (NSF) Industry/University Cooperative Research Center (I/U CRC), Center for Research in Intelligent Storage (CRIS) Workshop, Minneapolis, MN, USA, January 20-21, 2010.
Technical / Other Reports
Dongchul Park, "Enterprise search performance optimization for massive log data," Research report of Sookmyung Women's University and WINS Co., Ltd. joint project, January 2022.
Dongchul Park, "Efficient management and analysis of large log data using the state-of-the-art big data technologies," Research report of Sookmyung Women's University and WINS Co., Ltd. joint project, January 2021.
Dongchul Park, Biplob Debnath, Youngjin Nam, David H.C. Du, Youngkyun Kim and Youngchul Kim. HotDataTrap: A Sampling-based Hot Data Identification Scheme for Flash Memory. Computer Science and Engineering Technical Report, TR 11-009, University of Minnesota, May 2011.
Dongchul Park and David H.C. Du. Hot Data Identification for Flash Memory using Multiple Bloom Filters. Computer Science and Engineering Technical Report, TR 10-026, University of Minnesota, October 2010.
Dongchul Park, Biplob Debnath and David H.C. Du. CFTL: A Convertible Flash Translation Layer with Consideration of Data Access Patterns. Computer Science and Engineering Technical Report, TR 09-023, University of Minnesota, September 2009.
Dongchul Park, Jaehoon Jeong, David H.C. Du, Hongyeon Kim, Youngkyun Kim and Youngchul Kim. Research on Intelligent Data Management for Large-Scale Distributed File Systems in Multiple Data Centers Environment. Report of University of Minnesota and ETRI Joint Project, January 2009.
Dongchul Park, Jaehoon Jeong, David H.C. Du, Youngkyun Kim and Youngchul Kim. Research on Bulk Data Transfer & Backup Techniques for Large-Scale Distributed File Systems. Report of University of Minnesota and ETRI (Electronics and Telecommunications Research Institute, Korea) Joint Project, January 2008.