ПРИМЕНЕНИЕ БОЛЬШИХ ЯЗЫКОВЫХ МОДЕЛЕЙ ДЛЯ ИНТЕЛЛЕКТУАЛЬНОГО УПРАВЛЕНИЯ КЛАСТЕРАМИ KUBERNETES: ВОЗМОЖНОСТИ И ПЕРСПЕКТИВЫ

УДК 004.891.3:004.032.26
DOI: 10.36871/2618-9976.2025.11.011

Авторы

Сергей Александрович Ярушев,
Кандидат технических наук, директор центра перспективных исследований в искусственном интеллекте, Российский экономический университет им. Г.В. Плеханова, Москва, Россия
Александр Олегович Ануров,
Лаборант-исследователь центра перспективных исследований в искусственном интеллекте, Российский экономический университет им. Г.В. Плеханова, Москва, Россия
Геннадий Геннадиевич Булгаков,
Аспирант Российский экономический университет им. Г.В. Плеханова, Москва, Россия

Аннотация

В данной статье рассматриваются основные подходы к интеграции больших языковых моделей (LLM) в системы управления контейнерной оркестрации Kubernetes. Анализируется потенциал LLM для автоматизации операционных задач: генерации YAML-манифестов, оптимизации resource requests/limits, диагностики инцидентов. Предложена трехуровневая архитектура интеллектуального ассистента, включающая интерфейсный слой для natural language processing, слой обработки с RAGмеханизмами обогащения контекста и исполнительный слой взаимодействия с Kubernetes API. Обсуждаются ограничения, связанные с безопасностью (prompt injection, data leakage) и надежностью (hallucinations, API dependencies). Определены перспективные направления: разработка domain-specific моделей, мультимодальный анализ метрик, создание автономных агентов на базе reinforcement learning, а также проведен анализ потенциальных уровней интеграции LLM в оркестратор.

Ключевые слова

Kubernetes
LLM
DevOps automation
container orchestration
AIOps
cloud native
infrastructure as code
RAG

Список литературы

[1] Anthropic. (2024). Claude’s Constitution: Building Safer AI Systems. https://www.anthropic.com/news/claudesconstitution.

[2] Build a RAG chatbot with GKE and Cloud Storage. Technical overview. https://cloud.google.com/kubernetesengine/docs/tutorials/buildragchatbot.

[3] Burns B., Grant B., Oppenheimer D., Brewer E., & Wilkes J. (2016). Borg, Omega, and Kubernetes: Lessons learned from three containermanagement systems over a decade. ACM Queue, 14(1), 7093. https://dl.acm.org/doi/10.1145/2898442.2898444.

[4] Brown T., Mann B., Ryder N., et al. (2020). Language Models are FewShot Learners. Advances in Neural Information Processing Systems, 33, 1877–1901. https://arxiv.org/abs/2005. 14165.

[5] Chen M., Tworek J., Jun H., et al. (2021). Evaluating Large Language Models Trained on Code. https://arxiv.org/abs/2107.03374.

[6] Cloud Native Computing Foundation. CNCF Annual Survey 2024. https://www.cncf.io/ reports/cncfannualsurvey2024/.

[7] CNCF Technical Advisory Group. (2023). Cloud Native Glossary. https://glossary.cncf.io/.

[8] DataDog. (2024). Container Report 2024: Kubernetes and Cloud Native Adoption Trends. https://www.datadoghq.com/containerreport/.

[9] Feng Z., Guo D., Tang D., et al. (2020). CodeBERT: A PreTrained Model for Programming and Natural Languages. Findings of EMNLP 2020. https://arxiv.org/abs/2002.08155.

[10] Gartner. (2023). Market Guide for AIOps Platforms. https://www.gartner.com/en/ documents/4022167.

[11] Google Cloud. (2024). Site Reliability Engineering: How Google Runs Production Systems. https://sre.google/srebook/tableofcontents/.

[12] Huang Q., et al. (2024). KubectlAI: Natural Language Kubernetes Management. Proceedings of the 2024 USENIX Annual Technical Conference. https://www.usenix.org/conference/atc24.

[13] Kubernetes adoption, security, and market trends report 2024 Technical overview https://www.redhat.com/en/resources/kubernetesadoptionsecuritymarkettrendsoverview.

[14] Kubernetes Special Interest Group. (2024). Kubernetes Enhancement Proposals (KEPs). https://github.com/kubernetes/enhancements.

[15] Linux Foundation. (2024). 2024 Cloud Native Survey: Developer Trends Report. https://www.linuxfoundation.org/research/worldofopensource2024developertrends.

[16] Lewis P., Perez E., Piktus A., et al. (2020). RetrievalAugmented Generation for KnowledgeIntensive NLP Tasks. Advances in Neural Information Processing Systems, 33. https:// arxiv.org/abs/2005.11401.

[17] Microsoft Azure. (2024). AKS (Azure Kubernetes Service) Best Practices. https://learn. microsoft.com/enus/azure/aks/bestpractices.

[18] Obinna Johnphill, Ali Safaa Sadiq, Omprakash Kaiwartya (2024). An Intelligent Approach to Automated Operating Systems Log Analysis for Enhanced Security. https://www.mdpi.com/20782489/15/10/657.

[19] OpenAI. (2023). GPT4 Technical Report. https://arxiv.org/abs/2303.08774.

[20] Red Hat. (2024). State of Kubernetes Security Report 2024. https://www.redhat.com/en/resources/statekubernetessecurityreport.

[21] The Kubernetes Authors. Kubernetes Documentation. https://kubernetes.io/docs/.

[22] Verma, A., Pedrosa, L., Korupolu, M., Oppenheimer, D., Tune, E., & Wilkes, J. (2015). Largescale cluster management at Google with Borg. Proceedings of the European Conference on Computer Systems (EuroSys). https://research.google/pubs/pub43438/.

[23] Xiao Luo, Nur ZincirHeywood, Malcolm Heywood (2023). Benchmarking Large Language Models for Log Analysis, Security, and Interpretation. https://arxiv.org/html/2311.14519.

[24] Zhang, C., et al. (2023). K8sGPT: Giving Kubernetes Superpowers. GitHub Repository. https://github.com/k8sgptai/k8sgpt.

ПРИМЕНЕНИЕ БОЛЬШИХ ЯЗЫКОВЫХ МОДЕЛЕЙ ДЛЯ ИНТЕЛЛЕКТУАЛЬНОГО УПРАВЛЕНИЯ КЛАСТЕРАМИ KUBERNETES: ВОЗМОЖНОСТИ И ПЕРСПЕКТИВЫ

Авторы

Аннотация

Ключевые слова

Список литературы

Контакты

Карта сайта