Cloud-Based Information Retrieval for Big Data: A Survey of Architectures and Scalability Challenge

Main Article Content

Tanay Chowdhury

Abstract

Cloud computing has become a paradigm of managing, storing and retrieving large amounts of data emanating in contemporary digital applications. The mode of information retrieval (IR), which is typically insufficient in large-scale, heterogeneous, and dynamic data settings, has been severely challenged by the issue of big data, namely its high volume, high velocity, high diversity, high veracity, and high value. Cloud retrieval information systems take advantage of the elasticity, scalability and on-demand provisioning of cloud systems to facilitate effective and cost-effective access to data across distributed platforms. This work is a critical overview of the concept of big data and cloud-based IR, with a specific emphasis on the most significant models of cloud service, the specifics of data types, and the prospects of ML and DL to improve the quality of retrieval and relevance. Moreover, the paper logically examines key scalability issues, such as distributed storage management, index maintenance, query processing latency, load balancing and resource provisioning. All critical issues related to security and privacy, including leakage of data, insider threats, and vulnerability of programming interfaces, and multi-tenancy risks are also discussed. This paper, by summarizing the available literature and discovering gaps in the research, offers useful information on how scalable, secure, and intelligent information retrieval systems can be designed, as well as presents future research opportunities so as to facilitate reliable deployment of the system in data-intensive applications.

Article Details

How to Cite
Chowdhury, T. (2026). Cloud-Based Information Retrieval for Big Data: A Survey of Architectures and Scalability Challenge. The Eastasouth Journal of Information System and Computer Science, 3(03), 312–322. https://doi.org/10.58812/esiscs.v3i03.937
Section
Articles

References

[1] S. Balasubramaniam and V. Kavitha, “A Survey on Data Retrieval Techniques in Cloud Computing,” vol. 8, no. November, pp. 15–24, 2013.

[2] V. M. L. G. Nerella, “Architecting secure, automated multi-cloud database platforms strategies for scalable compliance,” Int. J. Intell. Syst. Appl. Eng., vol. 9, no. 1, pp. 128–138, 2021.

[3] I. Haneef, E. U. Munir, G. Qaiser, H. Gulfam, and A. Umar, “Big Data Retrieval : Taxonomy , Techniques and Feature Analysis Techniques for Big Data,” Int. J. Comput. Sci. Netw. Secur., vol. 18, no. 11, 2018.

[4] G. Sarraf and V. Pal, “Privacy-Preserving Data Processing in Cloud : From Homomorphic Encryption to Federated Analytics,” Int. J. Sci. Res. Comput. Sci. Eng. Inf. Technol., vol. 8, no. 2, pp. 735–706, 2022.

[5] S. B. V. Naga, K. C. Sunkara, S. Thangavel, and R. Sundaram, “Secure and Scalable Data Replication Strategies in Distributed Storage Networks,” Int. J. AI, BigData, Comput. Manag. Stud., vol. 2, no. 2, pp. 18–27, 2021, doi: 10.63282/3050-9416.IJAIBDCMS-V2I2P103.

[6] P. Karthikeyan, J. Amudhavel, A. Abraham, D. Sathian, R. S. Raghav, and P. Dhavachelvan, “A Comprehensive Survey on Variants And Its Extensions Of Big Data In Cloud Environment,” in Proceedings of the 2015 International Conference on Advanced Research in Computer Science Engineering & Technology (ICARCSET 2015), Mar. 2015, vol. 06-07-Marc, no. January, pp. 1–5. doi: 10.1145/2743065.2743097.

[7] J. Wang, Y. Yang, T. Wang, R. Simon Sherratt, and J. Zhang, “Big data service architecture: A survey,” J. Internet Technol., vol. 21, no. 2, pp. 393–405, 2020, doi: 10.3966/160792642020032102008.

[8] S. A. El-Seoud, H. F. El-Sofany, M. Abdelfattah, and R. Mohamed, “Big data and cloud computing: Trends and challenges,” Int. J. Interact. Mob. Technol., 2017, doi: 10.3991/ijim.v11i2.6561.

[9] S. Garg, “Predictive Analytics and Auto Remediation using Artificial Inteligence and Machine learning in Cloud Computing Operations,” Int. J. Innov. Res. Eng. Multidiscip. Phys. Sci., vol. 7, no. 2, 2019.

[10] M. Muniswamaiah, T. Agerwala, and C. Tappert, “Big Data in Cloud Computing Review and Opportunities,” Int. J. Comput. Sci. Inf. Technol., 2019, doi: 10.5121/ijcsit.2019.11404.

[11] M. D. Assunção, R. N. Calheiros, S. Bianchi, M. A. S. Netto, and R. Buyya, “Big Data computing and clouds: Trends and future directions,” J. Parallel Distrib. Comput., 2015, doi: 10.1016/j.jpdc.2014.08.003.

[12] A. K. Sandhu, “Big data with cloud computing: Discussions and challenges,” Big Data Min. Anal., vol. 5, no. 1, pp. 32–40, 2021.

[13] A. Alexandrescu, “Optimization and Security in Information Retrieval, Extraction, Processing, and Presentation on a Cloud Platform,” Information, vol. 10, no. 6, p. 200, Jun. 2019, doi: 10.3390/info10060200.

[14] A. Kushwaha, P. Pathak, and S. Gupta, “Review of optimize load balancing algorithms in cloud,” Int. J. Distrib. Cloud Comput., vol. 4, no. 2, pp. 1–9, 2016.

[15] A. V N Reddy, A. A. Kumar, N. Venu, and R. V. K. Reddy, “On optimization efficiency of scalability and availability of cloud-based software services using scale rate limiting algorithm,” Meas. Sensors, vol. 24, no. January, p. 100468, Dec. 2022, doi: 10.1016/j.measen.2022.100468.

[16] Y. S. Abdulsalam and M. Hedabou, “Security and Privacy in Cloud Computing: Technical Review,” Futur. Internet, vol. 14, no. 1, Dec. 2021, doi: 10.3390/fi14010011.

[17] S. V. Manikanta and K. Varaprasad, “A Secure Privacy Preserving Information Retrieval Model in Cloud Computing,” Int. J. Comput. Organ. Trends ( IJCOT ), vol. 9, no. 1, pp. 16–19, 2019.

[18] F. Wang, H. Wang, and L. Xue, “Research on Data Security in Big Data Cloud Computing Environment,” in 2021 IEEE 5th Advanced Information Technology, Electronic and Automation Control Conference (IAEAC), 2021, pp. 1446–1450. doi: 10.1109/IAEAC50856.2021.9391025.

[19] H. Kimm and J. Ortiz, “Multilevel Security Embedded Information Retrieval and Tracking on Cloud Environments,” in 2021 IEEE Cloud Summit (Cloud Summit), Oct. 2021, pp. 25–28. doi: 10.1109/IEEECloudSummit52029.2021.00012.

[20] Z. Tang, “A Preliminary Study on Data Security Technology in Big Data Cloud Computing Environment,” 2020. doi: 10.1109/ICBASE51474.2020.00013.

[21] Z. Li, J. Xu, Y. Zhao, W. Li, and W. Nie, “MPAN: Multi-Part Attention Network for Point Cloud Based 3D Shape Retrieval,” IEEE Access, vol. 8, pp. 157322–157332, 2020, doi: 10.1109/ACCESS.2020.3018696.

[22] P. Gao, Z. Han, and F. Wan, “Big Data Processing and Application Research,” in 2020 2nd International Conference on Artificial Intelligence and Advanced Manufacture (AIAM), Oct. 2020, pp. 125–128. doi: 10.1109/AIAM50918.2020.00031.

[23] N. N. Das, M. Chowdhary, R. Luthra, Maisera, and S. Garg, “Semantic Big Data Searching In Cloud Storage,” in 2019 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon), Feb. 2019, pp. 351–355. doi: 10.1109/COMITCon.2019.8862188.

[24] Y. B. Reddy, “Big Data Security in Cloud Environment,” 2018. doi: 10.1109/BDS/HPSC/IDS18.2018.00033.