AI's Silent Influence: Balancing the Wonders and Limitations in Our Digital Realm

Convolutional neural networks (CNNs) can recognise objects in images – provided their environment does not change. Mar Castell Erill explains the functionality and limitations of CNNs and makes us wonder whether AI applications are not sometimes overestimated.

KI und Nachhaltigkeit

View comments

In our modern world, artificial intelligence (AI) quietly plays a crucial role, managing various aspects of our digital lives. It's capable of tasks like diagnosing medical conditions, guiding autonomous vehicles, and even composing symphonies. In many ways, AI has surpassed human abilities. It's integrated into our daily lives, from helping with navigation on our phones to translating languages and responding to voice commands in our homes and places of work.

Despite AI's remarkable progress, neural networks–a fundamental component of modern AI–still have limitations. They lack common-sense reasoning, empathy and the nuanced understanding of human emotions.1 They're not capable of philosophical pondering or engaging in the creative expression that defines human art. CAPTCHAs are a good example of AI’s limitations.2

Hidden within the complex codes and algorithms of the internet lies a battleground where humans and AI engage in a covert competition. Websites, applications and online platforms utilize CAPTCHAs to distinguish human users from automated ones.3 These challenges, though seemingly simple, are based on intricate science that exposes the vulnerabilities of AI. Consider encountering a CAPTCHA that displays distorted characters, digits or symbols, some of which may not be neatly aligned, appearing askew or even rotated. Human cognition easily adapts to this challenge by mentally adjusting character orientations to decode them. However, for AI, this still presents a significant challenge.4

This is particularly true of convolutional neural networks (CNN), a type of deep learning model that aims to replicate the human visual system's image processing capabilities. CNNs use convolutional layers to extract hierarchical features from images, which makes them highly efficient at recognising patterns, shapes and objects. This fundamental architecture has positioned CNNs as a leading choice for image recognition tasks.5

One of the fundamental challenges associated with CNNs is their sensitivity to changes in an object's surroundings.6 Unlike humans, who can easily identify an object regardless of factors like orientation, background, texture or lighting, CNNs tend to produce incorrect results when presented with inputs that even slightly deviate from their established patterns. CNNs rely on fixed filters designed to detect specific features in an image. When an image is altered, perhaps due to factors such as rotation or blurriness, these features may not be represented in the same way. Consequently, the filters may struggle to recognise them, resulting in reduced accuracy in object recognition.7

The limited robustness of CNNs has significant implications for real-world applications.8 For instance, autonomous vehicles depend on image recognition to navigate. If a system based on CNNs encounters rotated road signs or obstacles, it might not perform well, which could raise safety issues. Likewise, in the field of medical imaging, the difficulty of dealing with rotated scans could impact the accuracy of disease diagnosis. These limitations, to some extent, hinder the widespread adoption of AI in our daily life.

Researchers are actively exploring strategies to mitigate CNNs' brittleness to common perturbations that humans easily navigate. The most obvious approach involves data augmentation, where training data is enriched with rotated versions of images.9 This exposes the network to a broader range of orientations during training, improving its ability to handle novel images to some extent. Other scholars have opted for customising the network’s architecture such as manipulating the pooling layers, the loss function or even introducing traditional machine learning algorithms for classification.10

Despite the challenges presented by neural networks' limited adaptability to physical perturbations, this limitation can be strategically harnessed for our benefit. This interactive relationship between AI and humans illustrates a mutually beneficial connection, where one's weaknesses can complement the strengths of the other. Beyond their primary security function, CAPTCHAs also serve as a benchmark task for evaluating AI technologies, as noted by von Ahn et al., who observed that ‘any program passing the tests generated by a CAPTCHA can be used to tackle challenging unsolved AI problems’.11

In the intricate dance between humans and AI, we've uncovered AI's remarkable capabilities, as well as its flaws. Interestingly, as we depend more on AI, we've learned to leverage its limitations to the point where distinguishing between human and machine intelligence has become a real challenge. As we navigate the hidden AI that shapes our digital world, we find ourselves in a paradoxical embrace in which technology, often written about, is also the author itself.

Footnotes

J. Alrassi, P. J. Katsufrakis, L. Chandran, L.: Technology Can Augment, but Not Replace, Critical Human Skills Needed for Patient Care. In: Academic Medicine. Volume 96, No. 1, 2020, 37–43. https://doi.org/10.1097/acm.0000000000003733; A. Kerasidou: , A.: Artificial intelligence and the ongoing need for empathy, compassion and trust in healthcare. In: Bulletin of the World Health Organization. Volume 98, No. 4, 2020, 245–250. https://doi.org/10.2471/blt.19.237198

R. Gossweiler, M. Kamvar, S. Baluja, S.: What’s up CAPTCHA? In: 2009: Proceedings of the 18th International Conference on World Wide Web. 2009. https://doi.org/10.1145/1526709.1526822; J. Kim, W. Chung, H. Cho: A new image-based CAPTCHA using the orientation of the polygonally cropped sub-images. In: The Visual Computer. Volume 26, No. 6–8, 2010, 1135–1143. https://doi.org/10.1007/s00371-010-0469-3; M. Guerar, L. Verderame, M. Migliardi, F. Palmieri, A. Merlo: Gotta CAPTCHA ’Em All: A Survey of Twenty years of the Human-or-Computer Dilemma. In: ACM Computing Surveys. Volume 54, 2021. https://doi.org/10.1145/3477142

M. Conti, L. Pajola, P.P. Tricomi: Captcha Attack: Turning Captchas against humanity. In: arXiv. 2022. https://doi.org/10.48550/arxiv.2201.04014

E. Bursztein, M. Martin, J. C. Mitchell: Text-based CAPTCHA strengths and weaknesses. In: CCS ’11: Proceedings of the 18th ACM Conference on Computer and Communications Security. 2011. https://doi.org/10.1145/2046707.2046724

Y. LeCun, K. Kavukcuoglu, C. Farabet: Convolutional networks and applications in vision. In: IEEE. 2010. https://doi.org/10.1109/iscas.2010.5537907; Y. LeCun, Y. Bengio, G. E. Hinton: Deep Learning. In: Nature. Volume 521, No. 7553, 2015, 436–444. https://doi.org/10.1038/nature14539; C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, A. Rabinovich, A.: Going deeper with convolutions. arXiv. 2015. https://doi.org/10.1109/cvpr.2015.7298594

I. J. Goodfellow, J. Shlens, C. Szegedy: Explaining and harnessing adversarial examples. In: International Conference on Learning Representations. 2015. https://ai.google/research/pubs/pub43405; A. Nguyen, J. Yosinski, J. Clune: Deep Neural Networks are Easily Fooled: High Confidence Predictions for Unrecognizable Images. In: arXiv. 2014. http://export.arxiv.org/pdf/1412.1897; A. Azulay, Y. Weiss, Y.: Why do deep convolutional networks generalize so poorly to small image transformations? In: arXiv. 2018. http://export.arxiv.org/pdf/1805.12177

T. Cohen, M. Welling: Group equivariant convolutional networks. In: arXiv. 2016 48, 2990–2999.

M. Ozdag, S. Raj, S. L. Fernandes, A. Velasquez, L. L. Pullum, S. K. Jha: On the Susceptibility of Deep Neural Networks to Natural Perturbations. In: International Joint Conference on Artificial Intelligence. 2019; R. C. Maron, J. G. Schlager, S.Haggenmüller, C. von Kalle, J. Utikal, F. Meier, F. F. Gellrich, S. Hobelsberger, A. Hauschild, L. E. French, L. Heinzerling, M. Schlaak, K. Ghoreschi, F. J. Hilke, G. Poch, M. V. Heppt, C. Berking, S. Haferkamp, W. Sondermann, T. J. Brinker: A benchmark for neural network robustness in skin cancer classification. In: European Journal of Cancer. Volume 155, 2021, 191–199. https://doi.org/10.1016/j.ejca.2021.06.047

S. Dieleman, K. Willett, J. Dambre: Rotation-invariant convolutional neural networks for galaxy morphology prediction. In: Monthly Notices of the Royal Astronomical Society. Volume 450, No. 2, 2015, 1441–1459. https://doi.org/10.1093/mnras/stv632; I. Kandel, M. Castelli, L. Manzoni, L.: Brightness as an augmentation technique for image classification. In: Emerging Science Journal. Volume 6, No. 4, 2022, 881–892. https://doi.org/10.28991/esj-2022-06-04-015

10.

T. Cohen, M. Welling: Group equivariant convolutional networks. In: arXiv. 2016, 2990–2999; D. Marcos, M. Volpi, N. Komodakis, D. Tuia: Rotation Equivariant Vector Field Networks. In: arXiv. 2017. https://doi.org/10.1109/iccv.2017.540

11.

Von L. Ahn, M. Blum, N. Hopper, J. Langford: CAPTCHA: Using hard AI problems for security. In: Lecture Notes in Computer Science. 2003, 294–311. https://doi.org/10.1007/3-540-39200-9_18

KI und Nachhaltigkeit

AI's Silent Influence: Balancing the Wonders and Limitations in Our Digital Realm

AI's Silent Influence: Balancing the Wonders and Limitations in Our Digital Realm

Wissen sie, was sie nicht wissen? Über die (Un-)Zuverlässigkeit …

Die KI-Forschung wurde schon einmal überschätzt

New comment