AI Voice Assistant and Caption Generation Using Convolution Neural Network and Bi LSTM

  • G. Rajasekaran Dhaanish Ahmed College of Engineering, Chennai, Tamil Nadu, India.
  • L.J. Dency Snowvin Department of Computer Science and Engineering, Dhaanish Ahmed College of Engineering, Padappai, Chennai, Tamil Nadu, India.
  • K. Vaishnavi Department of Computer Science and Engineering, Dhaanish Ahmed College of Engineering, Padappai, Chennai, Tamil Nadu, India.
  • D. Angelina Mary Department of Computer Science and Engineering, Dhaanish Ahmed College of Engineering, Padappai, Chennai, Tamil Nadu, India.
  • S. Suman Rajest Dhaanish Ahmed College of Engineering, Chennai, Tamil Nadu, India.
  • M. Mohamed Sameer Ali Dhaanish Ahmed College of Engineering, Chennai, Tamil Nadu, India.
Keywords: Technological Innovation, Educational Technology, Psychology, Physiology, Technology, Remote Work and Virtual Learning

Abstract

This research examines the physiological reactions to stress in light of the heightened stress levels observed among workers and students during the COVID-19 pandemic, particularly in distant work and virtual learning contexts. The study investigates several factors that affect stress levels in distant workers and students since it is important to understand these responses. By using ideas from psychology, physiology, and technology, the research finds the main causes of increased stress in different groups of people. The suggested approach has important effects on occupational health because it gives remote workers access to tools and information that can help them make their work environments healthier. In the same way, the system improves student well-being in virtual learning environments by giving them important support during the difficulties of remote learning. Additionally, this type of technology is useful in other areas, such as telemedicine, and it is helping to create technology-based solutions for managing stress and improving health in general. We want to help people deal with the stress of working from home and learning online by using new technologies and greater understanding of psychology. In the end, we want to help people become more resilient and healthy in the face of new obstacles.

References

G. Kulkarni, V. Premraj, S. Dhar, et al., “Baby talk: Understanding and generating simple image descriptions,” in Proc. IEEE Conf. Computer Vision and Pattern Recognition, IEEE Computer Society, United States of America, 2011.

A. F. Biten, L. Gomez, and D. Karatzas, “Let there be a clock on the beach: Reducing object hallucination in image captioning,” in Proc. IEEE/CVF Winter Conf. Applications of Computer Vision, Florida, United States of America, 2022.

L. A. Hendricks, K. Burns, K. Saenko, T. Darrell, and A. Rohrbach, “Women also snowboard: Overcoming bias in captioning models,” in Proc. European Conf. Computer Vision (ECCV). Munich, Germany, 2018.

T. Yao, Y. Pan, Y. Li, and T. Mei, “Exploring visual relationship for image captioning,” in Proc. European Conf. Computer Vision (ECCV), Munich, Germany, 2018.

S. Herdade, A. Kappeler, K. Boakye, and J. Soares, “Image captioning: Transforming objects into words,” in Advances in Neural Information Processing Systems, Vancouver, Canada, 2019.

S. A. Karthik, S. B. Naga, G. Satish, N. Shobha, H. K. Bhargav, and B. M. Chandrakala, “AI and IoT-infused urban connectivity for smart cities,” in Future of Digital Technology and AI in Social Sectors, D. Ertuğrul and A. Elçi, Eds. IGI Global Scientific Publishing, 2025, pp. 367–394.

S. Rashmi, B. M. Chandrakala, D. M. Ramani, and M. S. Harsur, “CNN based multi-view classification and ROI segmentation: A survey,” Global Transitions Proceedings, vol. 3, no. 1, pp. 86–90, 2022.

K. S. N. S. Nischal, N. S. Guvvala, C. Mathew, G. C. S. Gowda, and B. M. Chandrakala, “A survey on recognition of handwritten ZIP codes in a postal sorting system,” International Research Journal of Engineering and Technology (IRJET), vol. 7, no. 3, pp. 1–4, May 2020.

B. M. Chandrakala and S. C. Linga Reddy, “Proxy re-encryption using MLBC (Modified Lattice Based Cryptography),” in Proc. Int. Conf. Recent Advances in Energy-efficient Computing and Communication (ICRAECC), Nagercoil, India, 2019, pp. 1–5.

H. S. Supriya and B. M. Chandrakala, “An efficient multi-layer hybrid neural network and optimized parameter enhancing approach for traffic prediction in Big Data Domain,” The Journal of Special Education, vol. 1, no. 43, pp. 94–96, 2022.

R. Sushmitha, A. K. Gupta, and B. M. Chandrakala, “Automated segmentation technique for detection of myocardial contours in cardiac MRI,” in Proc. Int. Conf. Communication and Electronics Systems (ICCES), Coimbatore, India, 2019, pp. 986–991.

K. Shanthala, B. M. Chandrakala, N. Shobha, and D. D., “Automated diagnosis of brain tumor classification and segmentation of MRI images,” in Proc. Int. Conf. Confluence of Advancements in Robotics, Vision and Interdisciplinary Technology Management (IC-RVITM), Bangalore, India, 2023, pp. 1–7.

B. M. Chandrakala et al., “Harnessing online activism and diversity tech in HR through cloud computing,” in Future of Digital Technology and AI in Social Sectors, D. Ç. Ertuğrul and A. Elçi, Eds. IGI Global Scientific Publishing, 2025, pp. 151–182.

A. Navya and B. M. Chandrakala, “The effective dashboard to control the intrusion in the private protection of the cloudlet based on the medical mutual data using ECC,” in Proc. Int. Conf. Inventive Research in Computing Applications (ICIRCA), Coimbatore, India, 2018, pp. 538–543.

B. M. Chandrakala and S. C. Lingareddy, “Secure and efficient bi-directional proxy re-encryption technique,” in Proc. Int. Conf. Control, Instrumentation, Communication and Computational Technologies (ICCICCT), Kumaracoil, India, 2016, pp. 88–92.

V. Hiremath, “Quantum Networking: Strategic Imperatives for Enterprises and Service Providers in the Emerging Quantum Era,” Journal of Computational Analysis and Applications (JoCAAA), vol. 31, no. 3, pp. 617–631, Dec. 2023.

V. Hiremath, “Real-Time BGP Monitoring with BMP and Streaming Telemetry,” International Journal of Environmental Science, vol. 11, no. 1s, pp. 1109–1115, Mar. 2025.

N. J. Maiti, S. Ganguly, K. Choowongkomon, S. Seetaha, S. Saehlee, and T. Aiebchun, "Synthesis, in vitro Anti-HIV-1RT evaluation, molecular modeling, DFT and acute oral toxicity studies of some benzotriazole derivatives," J. Struct. Biol., vol. 216, no. 2, p. 108094, 2024.

N. J. Maiti and S. Ganguly, "Synthesis, spectral analysis, antimicrobial evaluation, molecular modelling, DFT, TD‐DFT and SAR studies of novel 4,5,6,7‐tetrabromo‐1H‐benzo[d][1,2,3]triazole derivatives," ChemistrySelect, vol. 9, no. 36, p. e202401746, 2024.

D. K. Arora et al., “An in vitro assessment of microleakage of pit and fissure sealants and restorative materials using dye penetration method,” Journal of Pharmacy and Bioallied Sciences, Feb. 2025.

R. Nagar et al., “In vitro analysis of compressive strength of three different aesthetic restorative materials,” Journal of Pharmacy and Bioallied Sciences, Feb. 2025.

N. Maiti et al., “Assessment of the efficacy of photobiomodulation (PBM) therapy in periodontal treatment: a longitudinal study,” Journal of Pharmacy and Bioallied Sciences, vol. 16, no. Suppl 3, pp. S2449–S2451, Jul. 2024.

N. J. Maiti and S. Ganguly, "Some new benzotriazole derivatives: Synthesis, antimycobacterial evaluation, antimicrobial efficacy, ADME studies, and molecular docking studies," Indian Journal of Heterocyclic Chemistry, vol. 33, no. 3, pp. 385–392, 2023.

N. J. Maiti, S. Ganguly, B. Sarkar, and R. Saha, "New benzotriazole derivatives: Synthesis, biological assessment, in vivo oral toxicity analysis, docking studies, molecular dynamics, and ADME profiling," Indian Journal of Heterocyclic Chemistry, vol. 33, no. 4, pp. 489–497, 2023.

N. J. Maiti, "A comprehensive review on analytical techniques for the quantification of pharmaceutical compounds in biological matrices," Journal of Cardiovascular Research, vol. 15, no. 9, 2024.

A. Vahora, R. Patel, B. Goradiya, and A. Desai, ‘Heart beat monitoring and wireless data logging using arm cortex A8’, International Journal on Recent and Innovation Trends in Computing and Communication, vol. 2, no. 8, pp. 2321–2325, 2014.

A. Vahora, B. Goradiya, D. Parikh, and A. Shah, ‘Designing a Model for Traffic Rule Violation at Railway Track Using Raspberry Pi in Indian Context’, International Journal of Latest Technology in Engineering,Management & Applied Science, vol. 6, no. 6, pp. 122–125, 2017.

A. Vahora and K. Pandya, ‘Implementation of cylindrical dielectric resonator antenna array for Wi-Fi/wireless LAN/satellite applications’, Progress in Electromagnetics Research M, vol. 90, pp. 157–166, 2020.

A. Vahora and K. Pandya, ‘Triple Band Dielectric Resonator Antenna Array Using Power Divider Network Technique for GPS Navigation/Bluetooth/Satellite Applications’, International Journal of Microwave and Optical Technology, vol. 15, no. 4, pp. 369–378, 2020.

A. Vahora and K. Pandya, ‘A miniaturized cylindrical dielectric resonator antenna array development for GPS/Wi-Fi/wireless LAN applications’, e-Prime-Advances in Electrical Engineering, Electronics and Energy, vol. 2, p. 100044, 2022.

D. Sumathi and P. Poongodi, "Scheduling Based on Hybrid Particle Swarm Optimization with Cuckoo Search Algorithm in Cloud Environment," IIOAB Journal, vol. 7, no. 9, pp. 358-366, 2016.

D. Sumathi and P. Poongodi, "Secure medical information processing in cloud: Trust with swarm based scheduling," Journal of Medical Imaging and Health Informatics, vol. 6, no. 7, pp. 1636-1640, 2016.

D. Sumathi and P. Poongodi, "An improved scheduling strategy in cloud using trust based mechanism," Int. J. Comput. Electr. Autom. Control Inf. Eng, vol. 9, no. 2, pp. 637-641, 2015.

V. B. Gowda, M. T. Gopalakrishna, J. Megha, and S. Mohankumar, “Foreground segmentation network using transposed convolutional neural networks and up sampling for multiscale feature encoding,” Neural Netw., vol. 170, pp. 167–175, 2024.

V. B. Gowda, G. M. Thimmaiah, M. Jaishankar, and C. Y. Lokkondra, “Background-foreground segmentation using Multi-scale Attention Net (MA-Net): A deep learning approach,” Rev. Intell. Artif., vol. 37, no. 3, pp. 557–565, 2023.

V. B. Gowda, M. G. Krishna, and J. Megha, “Dynamic Background Modeling and Foreground Detection using Orthogonal Projection onto the Subspace of Moving Objects,” in Proc. IC3, 2023, pp. 171–176.

V. B. Gowda, M. T. Gopalakrishna, J. Megha, and S. Mohankumar, “Background initialization in video data using singular value decomposition and robust principal component analysis,” Int. J. Comput. Appl., vol. 45, no. 9, pp. 600–609, 2023.

D. Sumathi, B. Melinamath, and R. Goyal, "Iov Traffic Prediction Utilizing Bidirectional Memory and Spatiotemporal Constraints with Local Search and NonLinear Analysis," Journal of Computational Analysis & Applications, vol. 33, no. 2, 2024.

D. Sumathi, A. Singh, A. Sinha, D. Aditya, and M. R. KF, "The Deepfake Dilemma: Enhancing Deepfake Detection with Vision Transformers," in 2025 International Conference on Intelligent and Innovative Technologies in Computing, Electrical and Electronics, Jan. 2025, pp. 1-7.

A. K. Joshi and S. B. Kulkarni, “Flow analysis of vehicles on a lane using deep learning techniques,” J. Adv. Inf. Technol., vol. 14, no. 6, pp. 1354–1364, 2023.

A. K. Joshi, V. Shirol, S. Jogar, P. Naik, and A. Yaligar, “Credit card fraud detection using machine learning techniques,” Int. J. Sci. Res. Comput. Sci. Eng. Inf. Technol., vol. 6, no. 3, pp. 436–442, 2020.

A. K. Joshi and S. B. Kulkarni, “Multi-modal information fusion for localization of emergency vehicles,” Int. J. Image Graph., vol. 24, no. 1, Art. no. 2550050, 2024.

A. K. Joshi and S. B. Kulkarni, “Multimodal deep learning information fusion for fine-grained traffic state estimation and intelligent traffic control,” Int. J. Intell. Syst. Appl. Eng., vol. 11, no. 3, pp. 1020–1029, 2023.

A. Vahora and K. Pandya, ‘A Low-profile 4-element Circularly Polarized Hexagonal DRA Array for Triple-band Wireless Applications’, Advanced Electromagnetics, vol. 11, no. 4, pp. 90–97, 2022.

A. Vahora and M. Munsuri, ‘Smart Embedded System for Physiological Monitoring Using Machine Learning and Sensor Fusion’, Journal of Neonatal Surgery, vol. 14, no. 19s, pp. 694–703, 2025.

M. Fafolawala, Y. Mehta, and A. Vahora, ‘Agricultural Drones: Transforming Farming Practices with Advanced Technology’, International Journal Of Latest Technology In Engineering,Management & Applied Science, vol. 14, no. 4, pp. 877–882, 2025.

A. Vahora, M. Fafolawala, and Y. Mehta, ‘Federated Learning-Enabled Air Quality Monitoring System for Safe Driving in IoT-Integrated Vehicles’, International Journal of Environmental Sciences, vol. 11, no. 4s, pp. 715–723, 2025.

V. S. A. Anala, A. R. Pothu, and S. Chintapalli, “Enhancing Preventive Healthcare with Wearable Health Technology for Early Intervention,” FMDB Transactions on Sustainable Health Science Letters., vol.2, no.4, pp. 211–220, 2024.

V. S. A. Anala and S. Chintapalli, “Scalable Data Partitioning Strategies for Efficient Query Optimization in Cloud Data Warehouses,” FMDB Transactions on Sustainable Computer Letters., vol. 2, no. 4, pp. 195–206, 2024.

N. Ansari, G. Singh, R. Singh and Sheetal, "Innovative herbal tea formulation using Holarrhena antidysenterica, Emblica officinalis, and Stevia: Nutritional and phytochemical analysis," J. Neonatal Surg., vol. 14, no. 6, pp. 381-389, 2025.

S. Bala, G. Singh and M. Kaur, "Mindfulness of functional foods in cancer prevention and health promotion: A comprehensive review," Rev. Electron. De Vet., vol. 25, no. 1, pp. 1181-1187, 2024.

S. Bala, G. Singh, R. Arora and Devanshika, "Impact of caffeine consumption on stress management and stamina among university students," Rev. Electron. De Vet., vol. 25, no. 2, pp. 253-259, 2024.

G. Singh and M. Khatana, "Assessment regarding the efficacy of intermittent fasting," Int. J. Res. Anal. Rev., vol. 9, no. 1, pp. 2349-5138, 2022.

G. Singh et al., "Analyze the effects of prebiotics on the immunity of human beings through various clinical studies," Jundishapur J. Microbiol., vol. 15, no. 1, pp. 1167-1177, 2022.

G. Singh et al., "Comprehensive look of renal calculi in kidneys: A review," NeuroQuantology, vol. 20, no. 5, pp. 4404-4412, 2022.

Md H. Rahman, T. Islam, Md E. Hossen, Md E. Chowdhury, R. Hayat, Md S. S. Shovon, M. Alamgir, S. Akter, R. Chowdhury, and A. R. Sunny, "Machine Learning in Healthcare: From Diagnostics to Personalized Medicine and Predictive Analytics," J. Angiother., vol. 8, no. 12, pp. 1–8, 2024.

R. Chowdhury, Md A. H. Fahad, S. M. S. Alam, M. I. Tusher, Md N. U. Rana, E. Ahmed, S. S. Akhi, and Md R. H. Mahin, "Database Management in the Era of Big Data: Trends, Challenges, and Breakthroughs," Pathfinder Res., vol. 1, no. 1, p. 15, 2020.

Md R. H. Mahin, E. Ahmed, S. S. Akhi, Md A. H. Fahad, M. I. Tusher, R. Chowdhury, and Md N. U. Rana, "Advancements and Challenges in Software Engineering and Project Management: A 2021 Perspective," Pathfinder Res., vol. 2, no. 1, p. 15, 2021.

Md A. H. Fahad and R. Chowdhury, "Evolution and Future Trends in Web Development: A Comprehensive Review," Pathfinder Res., vol. 3, no. 1, p. 13, 2022.

S. N. Akhter, R. Kumari, and A. Kumar, “Fertility booster effect of Asparagus recemosus against arsenic induced reproductive toxicity in Charles Foster rats,” J. Adv. Zool., vol. 45, no. 5, 2024.

Z. Hashmi, R. Kumari, and A. Kumar, “Antidote effect of Bacopa Koneru against arsenic induced toxicity in rats,” J. Adv. Zool., vol. 45, no. 5, 2024.

Z. Hashmi, R. Kumari, and A. Kumar, “Phytoremedial effect of Ocimum sanctum against arsenic induced toxicity in Charles Foster rats,” J. Adv. Zool., vol. 45, no. 5, 2024.

B. Kumari, P. Das, and R. Kumari, “Accelerated processing of solitary and clustered abasic site DNA damage lesion by APE1 in the presence of aqueous extract of Ganoderma lucidum,” J. Biosci., vol. 41, pp. 265–275, 2016.

R. Kumari, R. K. Singh, N. Kumar, and R. Kumari, “Preparation of superfine Bael leaf nanopowder, physical properties measurements and its antimicrobial activities,” Egypt. Chem. Bull., vol. 12, no. 4, pp. 284–297, 2023.

M. K. Sinha, R. Kumari, and A. Kumar, “Ameliorative effect of Ganoderma lucidum on sodium arsenite induced toxicity in Charles Foster rats,” J. Adv. Zool., vol. 45, no. 5, 2024.

V. Rajavel, "Integrating power-saving techniques into design for testability of semiconductors for power-efficient testing," The American Journal of Engineering and Technology, vol. 7, no. 3, pp. 243–251, 2025.

V. Rajavel, "Novel machine learning approach for defect detection in DFT processes," ASRJETS-Journal, vol. 101, no. 1, pp. 325–334, Apr. 2025. [Online].

V. Rajavel, "Optimizing semiconductor testing: Leveraging stuck-at fault models for efficient fault coverage," Int. J. Latest Eng. Manag. Res. (IJLEMR), vol. 10, no. 2, pp. 69–76, Feb. 2025.

Md S. Miah and Md S. Islam, "Big Data Analytics Architectural Data Cut off Tactics for Cyber Security and Its Implication in Digital forensic," in Proc. 2022 Int. Conf. Futuristic Technol. (INCOFT), Belgaum, India, 2022, pp. 1–6.

M. A. Obaida, Md S. Miah, and Md. A. Horaira, “Random Early Discard (RED-AQM) Performance Analysis in Terms of TCP Variants and Network Parameters: Instability in High-Bandwidth-Delay Network,” Int. J. Comput. Appl., vol. 27, no. 8, pp. 40–44, Aug. 2011.

A. Srivastava, “Use of Python in Data Science, Data Integration and Data Engineer,” Int. J. Sci. Res. Eng. Manag., vol. 8, no. 7, 2024.

A. Srivastava, “AI in Healthcare and its Future,” J. Artif. Intell. Cloud Comput., vol. 1, no. 1, pp. 1–2, Mar. 2022.

A. Srivastava, “Cloud Replacing Traditional Database,” Int. J. Multidiscip. Res., vol. 7, no. 2, pp. 1–2, Mar.–Apr. 2025.

A. Srivastava, “Data Transformation Normalization to Denormalization in Cloud,” Int. J. Core Eng. Manag., vol. 6, no. 7, pp. 249–252, 2020.

A. Srivastava, “Impact of AI/ML on Job Market and Skills Set and Health Industry,” ESP J. Eng. Technol. Adv., vol. 4, no. 3, pp. 122–126, 2024.

G. Kashyap, "Neural Architecture Search (NAS): Exploring the Trade-Offs In Automated Model Design and Its Impact on Deep Learning Performance," Int. J. Innov. Res. Eng. Multidiscip. Phys. Sci., vol. 13, no. 2, pp. 1–12, Mar.–Apr. 2025.

G. Kashyap, "Large Language Models and Their Ethical Implications: The role of models like GPT and BERT in shaping future AI applications and their risks," Int. J. Innov. Res. Creat. Technol., vol. 6, no. 6, pp. 1–5, Dec. 2020.

G. Kashyap, "AI for Epidemiology: Using AI to Predict and Track the Spread of Diseases like COVID-19," Int. J. Innov. Res. Multidiscip. Field, vol. 3, no. 6, pp. 1–10, Nov. 2021.

G. Kashyap, "AI for Threat Detection and Mitigation: Using AI to Identify and Respond to Cybersecurity Threats in Real-Time," Int. J. Sci. Res. Eng. Manag. Sci., vol. 6, no. 6, pp. 1–5, Nov. 2024.

G. Kashyap, "AI for Information Retrieval: Advancements in Search Engines and Chatbots through Deep Learning-Based Query Understanding," Int. J. Innov. Res. Creat. Technol., vol. 7, no. 1, pp. 1–7, Jan. 2021.

G. Kashyap, "Multilingual NLP: Techniques for Creating Models that Understand and Generate Multiple Languages with Minimal Resources," Int. J. Sci. Res. Eng. Manag. Sci., vol. 6, no. 12, pp. 1–5, Dec. 2024.

G. Singh, S. Bala and S. Singh, "Nutraceuticals miraculously alter human genomics: A review on nutrigenomics," Afr. J. Biol. Sci., vol. 6, no. 6, pp. 599-699, 2024.

G. Singh, S. Bala, M. Kaur and S. Phagna, "Exploring public awareness and attitudes towards dietary supplements," Afr. J. Biol. Sci., vol. 6, no. 6, pp. 7288-7299, 2024.

G. Singh, J. Bharti and C. Dua, "Mindfulness of nutritional knowledge and food hygiene practices on the health among young adults," Afr. J. Biol. Sci., vol. 6, no. 6, pp. 5813-5818, 2024.

G. Singh and G. K. Kochar, "Zinc content of commonly consumed foods of Kurukshetra district of Haryana," Food Sci. Res. J., vol. 1, no. 2, pp. 94-98, 2010.

G. Singh et al., "Liver cirrhosis: The struggling liver," Int. J. Health Sci., vol. 6, no. 1, pp. 5547-5559, 2022.

Published
2025-08-07
How to Cite
Rajasekaran, G., Snowvin, L. D., Vaishnavi, K., Mary, D. A., Rajest, S. S., & Ali, M. M. S. (2025). AI Voice Assistant and Caption Generation Using Convolution Neural Network and Bi LSTM. CENTRAL ASIAN JOURNAL OF MATHEMATICAL THEORY AND COMPUTER SCIENCES, 6(4), 758-772. Retrieved from https://cajmtcs.centralasianstudies.org/index.php/CAJMTCS/article/view/807
Section
Articles