OPTIMIZATION OF NEURAL NETWORKS IN DIFFUSION MODELS FOR SKETCH GENERATION USING BEZIER CURVE

Authors

DOI:

https://doi.org/10.35546/kntu2078-4481.2026.1.29

Keywords:

optimization, neural networks, graphic content, scalability, Bezier curves

Abstract

This article presents a comprehensive study regarding an innovative approach to optimising the computational process of generating artistic sketches, specifically focusing on the significant task of automating digital drawing workflows. Following a detailed and rigorous analysis of existing methodologies, a strategic decision was made to utilize parametric curve-based abstract data structures – specifically Bézier curves – as a superior alternative to the traditional, widely utilized raster pixel representations. This fundamental transition to using mathematical curve descriptions provides significantly broader opportunities for constructing compact, efficient models that yield higher quality and fully scalable generation results. Through this research, a robust model capable of generating novel geometric data from an input sketch was obtained, and the stability, consistency, and variability of the generation process were rigorously analyzed to ensure reliability. A critical component of this study involves a comparative analysis of the neural network parameters required for generating graphics in standard pixel formats versus the proposed Bézier curve method. The empirical results demonstrated that the employment of Bézier curves significantly reduces both the number of necessary input parameters and the overall computational complexity of the neural network architecture. To illustrate this computational disparity, creating a model to generate a standard 32 × 32 pixel image requires 1,024 input parameters (if the pixels are represented in black and white format) or 3,072 parameters (if in RGB format). In stark contrast, the model utilized to generate scalable curves requires only 800 parameters, representing a substantial reduction in computational load while maintaining structural integrity. The pivotal advantage of this new approach lies in its provision of perfect scalability; unlike traditional pixel (raster) formats, the generated data can be effortlessly enlarged or reduced to any dimension with zero loss of visual fidelity. Consequently, the quality of the generated image is no longer bound by the fixed resolution limitations inherent in raster formats but is determined solely by the potential and sophistication of the neural network architecture.

References

Hutson, J., & Robertson, B. (2023). A Matter of Perspective: A Case Study in the Use of AI-Generative Art in the Drawing Classroom. International Journal of New Media, Technology & the Arts, 18.

Porvari, E. (2024). Exploring the potential of AI utilization in the interpretation and digitization of legacy drawings.

Yang, L., Zhang, Z., Song, Y., Hong, S., Xu, R., Zhao, Y.,... & Yang, M. H. (2023). Diffusion models: A comprehensive survey of methods and applications. ACM computing surveys, 56(4), 1–39.

Cottier, B., Rahman, R., Fattorini, L., Maslej, N., Besiroglu, T., & Owen, D. (2024). The rising costs of training frontier AI models. arXiv preprint arXiv:2405.21015.

Mudrinić, D., & Šoda, I. (2024, November). The Rise of Generative Artificial Intelligence in Business. In 2024 IEEE 17th International Scientific Conference on Informatics (Informatics) (pp. 493–498). IEEE.

Kusetogullari, A., Kusetogullari, H., Andersson, M., & Gorschek, T. (2025). GenAI in entrepreneurship: A systematic review of generative artificial intelligence in entrepreneurship research: Current issues and future directions. arXiv preprint arXiv:2505.05523.

Zhang, M., & Li, J. (2021). A commentary of GPT-3 in MIT Technology Review 2021. Fundamental Research, 1(6), 831-833.

Puteikis, K., & Mameniškienė, R. (2024). Artificial intelligence: Can it help us better grasp the idea of epilepsy? An exploratory dialogue with ChatGPT and DALL· E 2. Epilepsy & Behavior, 156, 109822.

Liu, Y., Zhang, K., Li, Y., Yan, Z., Gao, C., Chen, R.,... & Sun, L. (2024). Sora: A review on background, technology, limitations, and opportunities of large vision models. arXiv preprint arXiv:2402.17177.

Samson, G. (2025, June). Perspectives on Generative Sound Design: A Generative Soundscapes Showcase. In Arts (Vol. 14, No. 3, p. 67). MDPI.

Goodfellow, I. J., Pouget-Abadie, J., Mirza, M., Xu, B., Warde-Farley, D., Ozair, S.,... & Bengio, Y. (2014). Generative adversarial nets. Advances in neural information processing systems, 27.

Sohl-Dickstein, J., Weiss, E., Maheswaranathan, N., & Ganguli, S. (2015, June). Deep unsupervised learning using nonequilibrium thermodynamics. In International conference on machine learning (pp. 2256–2265). pmlr.

Ho, J., Jain, A., & Abbeel, P. (2020). Denoising diffusion probabilistic models. Advances in neural information processing systems, 33, 6840-6851.

Nichol, A. Q., & Dhariwal, P. (2021, July). Improved denoising diffusion probabilistic models. In International conference on machine learning (pp. 8162–8171). PMLR.

Xiang, X., Liu, D., Yang, X., Zhu, Y., & Shen, X. (2021). Anime2Sketch: A Sketch Extractor for Anime Arts with Deep Networks [Computer software]. GitHub. https://github.com/Mukosame/Anime2Sketch

Xiang, X., Liu, D., Yang, X., Zhu, Y., Shen, X., & Allebach, J. P. (2022). Adversarial open domain adaptation for sketch-to-photo synthesis. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (pp. 1434–1444).

Веретюк, O., Огерук, Б., & Андрущак, Н. (2025). Використання шумів для стабільного генерування зображень у дифузійних моделях. Комп’ютерні системи проєктування. Теорія і практика. 7(1), 113–123. https://doi.org/10.23939/cds2025.01.113

Published

2026-04-30