Top 7 Concerns of Technology Leaders That Implemented Agentic AI

Artificial Intelligence is evolving beyond narrow, task-specific applications into agentic AI—systems capable of making autonomous decisions, adapting to dynamic environments and taking independent actions to achieve goals. This paradigm shift presents unprecedented opportunities for automation, efficiency and innovation. However, as organisations move toward deploying AI agents in critical operations, technology leaders must address several fundamental concerns.

For CTOs and tech executives in general, the question is no longer whether to implement agentic AI but how to do so responsibly and securely. The risks of unchecked autonomy, biased decision-making and unpredictable behaviour demand a structured approach to AI governance, validation and human oversight.

This article explores the core challenges of agentic AI, backed by real-world case studies, and outlines the best mitigation strategies to ensure safe, accountable and effective AI deployment.

7 Concerns of Technology Leaders That Implemented Agentic AI - visual presentation

1. Data Protection

In 2023, Samsung engineers inadvertently leaked confidential company code by using ChatGPT to optimise their programming scripts. The AI model retained sensitive trade secrets, which could have been accessed by OpenAI or other users, highlighting the risks of AI-enabled data leaks.

When users share data with AI chatbots, it is stored on the servers of companies like OpenAI, Microsoft and Google—often without a straightforward way to access or delete it. This raises concerns about sensitive information being shared with chatbots like ChatGPT that could unintentionally become accessible to other users.

By default, ChatGPT saves chat history and uses conversations to improve its models. While users can manually disable this feature, it’s unclear whether the setting applies to past conversations retroactively or if it’s working at all because it is virtually impossible to audit data that OpenAI and other providers use to train their models.

Technology leaders face a dilemma here: We either act in good faith and use products or ban the use of Gen AI tools as Samsung did. If we do use those products, we must accept three possibilities:

Employees may input confidential information into AI without realising it could be stored or used for future model training.
Even with data governance policies in place to prevent sensitive data from being shared with external AI services, history taught us that providers often ignore those rules because data is a commodity.
Due to a lack of visibility and access control, a company’s secrets could be exposed without a clear way to delete or retract them.

This is what we can do to at least minimise exposure:

Use role-based access controls (RBAC) to limit data access to only necessary personnel or AI modules.

Implement access controls and encryption at all levels to prevent AI from having unrestricted access to sensitive data.

Instead of centralising all user data, AI can learn from noise-injected distributed datasets without exposing raw information. This prevents raw data exposure but does not affect AI capabilities.

Train AI models in secure environments with masked or anonymised data (synthetic data instead of real user information w/ Zero Trust architectures).

Ensure that AI-driven data processing aligns with compliance requirements (requires AI explainability functionality).

That’s, unfortunately, the reality because we have limited control over data protection when using a third-party SaaS. But what can we do to prevent Agentic AI systems from acting erratically?

2. Loss of Control

Agentic AI systems and AI in general could act unpredictably. Often, this refers to pursuing objectives misaligned with our intentions. This concern is even more emphasised in high-stakes scenarios because we entrust a complex code with the “black box” feature to make decisions on our behalf.

The malfunctioning can cause an array of implications. For example:

Risk of harmful outcomes.
Inability to intervene effectively.
Potential cascading failures.

On March 18, 2018, an Uber self-driving test vehicle in Tempe, Arizona, struck and killed a pedestrian, Elaine Herzberg. This was the first recorded fatality involving a fully autonomous vehicle, raising serious concerns about loss of control in AI-driven systems. The vehicle’s onboard AI was designed to detect and react to obstacles autonomously, but a failure in decision-making and override mechanisms led to a tragic accident.

The AI incorrectly classified the pedestrian as an unknown object rather than a human, delaying its response. To make things worse, Uber had disabled the vehicle’s built-in emergency braking system, relying entirely on AI-driven decision-making. However, the system was tuned to reduce false positives, meaning it hesitated before deciding to stop which turned out to be a fatal miscalculation.

A human safety driver was present but not paying attention at the critical moment, as AI was expected to handle the situation. The software did eventually order the car to brake 1.3 seconds before the collision but it was too late.

This incident just goes to show that blind reliance on Agentic AI — programmed by humans — can have devastating outcomes.

Mitigation Strategies for Loss of Control in Agentic AI

1. Goal Alignment and Robust Objective Design

Ensure AI systems have clearly defined objectives that align with human values and intentions.
Use techniques such as reward modelling to guide the system’s behaviour toward desired outcomes.
Regularly test the system in diverse scenarios to ensure its objectives remain aligned.

A good example is OpenAI’s approach with reinforcement learning from human feedback (RLHF). This method uses active human guidance to shape the system’s behaviour, ensuring that its autonomous decisions align with human intentions.

2. Control Mechanisms and Fail-Safes

Build robust mechanisms for human oversight, such as kill switches, manual overrides or adjustable autonomy levels.
Ensure that all systems have multiple layers of control to ensure humans can intervene and regain control if the AI behaves unexpectedly.

In autonomous vehicle development, for example, companies like Tesla include manual steering wheel overrides, allowing drivers to take control when necessary.

3. Explainability and Transparency

Incorporate explainability into the AI design, ensuring the system’s decision-making process can be understood and monitored.
Use techniques like decision trees or attention maps to provide insights into how and why decisions are made.

IBM’s Watson Health, for example, uses explainable AI to assist doctors in diagnosing diseases by showing the reasoning behind its recommendations. The approach builds trust in its outputs because users have more control over the AI.

4. Iterative Testing and Simulation

Test AI systems extensively in simulated and real-world environments to identify and mitigate potential risks before deployment.
Use adversarial testing to expose vulnerabilities and create mitigation strategies for unforeseen behaviours.

A good example here is DeepMind’s AlphaGo which was tested in millions of simulated games. The extensive training allowed researchers to fine-tune its behaviour and prevent erratic strategies.

As much as it can be difficult sometimes, following industry standards and regulatory frameworks ensures the safe development and deployment of agentic AI. That said, both developers and end users should continuously work with policymakers and standards organisations to enforce safety protocols and regular audits.

And the prerequisite for that is monitoring and updating; in other words, deploying systems with continuous monitoring capabilities to detect and address deviations from expected behaviour. For example, AWS and Azure allow developers to update and retrain deployed models to maintain performance and control.

3. Ethical and Moral Challenges

Agentic AI systems face ethical dilemmas, such as deciding whose safety to prioritise or whether to follow instructions that conflict with moral principles. Decisions may not align with societal values, leading to public backlash or regulatory scrutiny.

In 2016, Facebook experienced this backlash when the company faced criticism after its News Feed algorithm inadvertently promoted fake news and divisive content, raising concerns about the ethical implications of its design. It was a blatant example of a total lack of oversight of the algorithm’s impact on public discourse and a complete absence of ethical considerations. The algorithm simply prioritised engagement over truth.

To mitigate this, Facebook implemented fact-checking partnerships with third-party organisations to address misinformation and started conducting regular ethical reviews to identify and mitigate unintended harms. Additional tools were developed to prioritise high-quality information and limit the spread of harmful content.

Mitigation Strategies

1. Embedding Ethical Frameworks

Google’s AI Principles explicitly prohibit building AI systems that cause harm or reinforce bias, ensuring ethical guardrails. They collaborated with ethicists, domain experts and diverse stakeholders to define moral principles and embed them into the AI’s decision-making algorithms.

2. Value Alignment through Human-Centric Design

As we already said, OpenAI employed RLHF for ChatGPT, which involves training the model to align its responses with user-defined ethical standards. It is a proven approach to ensure AI systems reflect human values. It is done through regular feedback from diverse groups of users because it’s imperative to have an AI system that reflects a broad range of perspectives.

3. Ethical Audits and Impact Assessments

Microsoft’s AI, Ethics, and Effects in Engineering and Research (Aether) committee regularly reviews the company’s AI projects for ethical risks. The committee conducts regular ethical audits and AI impact assessments (AIIAs) to evaluate the social, environmental and moral implications of AI deployments. This is the practice that can be utilised by every organisation simply by establishing independent review boards to assess ethical risks and provide actionable recommendations.

4. Bias Mitigation

Already mentioned IBM’s Watson Health faced criticism for recommending different cancer treatments based on biased training data. The company addressed this by revising datasets and involving clinicians in the training process. In other words, to eliminate bias from the algorithms:

Use diverse high-quality datasets.
Implement fairness-aware machine learning techniques.
Validate results against known benchmarks.

5. Transparent and Explainable AI

Similar to IBM’s example, DARPA’s Explainable AI (XAI) program focuses on developing systems that justify their decisions, enabling users to identify ethical concerns. These systems utilise tools like LIME (Local Interpretable Model-agnostic Explanations) to make AI decisions interpretable and assess their ethical soundness.

6. Scenario Testing and Simulations

Autonomous vehicle companies like Waymo conduct ethical scenario testing to evaluate how their systems handle life-critical situations, that is, whom to prioritise in a potential collision. They do that in simulated environments to explore how they respond to ethical dilemmas before deployment. These simulations mimic real-world ethical conflicts and analyse the system’s decision-making process.

4. Security Risks

Agentic AI systems can be manipulated, hacked or even weaponised, with autonomous decision-making amplifying their destructive potential. We all saw that ChatGPT-powered gun on YouTube, didn’t we?

In 2020, the SolarWinds cyberattack demonstrated the risks associated with compromised AI supply chains. Malicious actors injected malware into the Orion software platform, impacting thousands of clients, including government agencies.

This case demonstrated a serious lack of robust monitoring in the software update process and insufficient measures to detect and prevent supply chain attacks. To mitigate this and reestablish trust, the company had to implement code-signing practices and enhanced monitoring tools while partnering with security agencies and third-party audits.

Mitigation Strategies for Security Risks in Agentic AI

1. Robust Threat Modeling

We must identify potential threats specific to the AI system and its deployment environment, including adversarial attacks and data poisoning. To achieve that, we can use comprehensive threat modelling techniques, such as STRIDE (Spoofing, Tampering, Repudiation, Information disclosure, Denial of service, Elevation of privilege), to evaluate risks and develop countermeasures.

Google DeepMind, for instance, employs advanced threat modelling for AI systems to assess and mitigate vulnerabilities.

2. Secure Development Practices

OpenAI adopted secure development practices to minimise risks in GPT-based models, including API rate-limiting to prevent misuse. They employ techniques such as differential privacy and secure multiparty computation to protect sensitive data used in AI training and deployment.

3. Adversarial Testing

Tesla tests its autonomous vehicle systems against adversarial inputs, such as altered road signs, to ensure the AI behaves correctly in manipulated environments. They use adversarial examples to evaluate how the system reacts to maliciously crafted inputs. These simulations of real-world attacks have two goals:

Test the AI system’s resilience.
Identify vulnerabilities.

4. Continuous Monitoring and Incident Response

By default, AI systems should integrate robust monitoring and alert mechanisms, enabling swift responses to potential security threats. They detect anomalies and security breaches that are sent to dedicated incident response teams that utilise protocols to address security incidents as they occur.

5. Multi-Factor Authentication (MFA) and Access Controls

Back to basic cybersecurity – limit access to AI systems and their underlying infrastructure using strong authentication methods and role-based access controls. Zero-trust policies are still the best first line of defence.

The additional mitigation strategies are:

Encryption and Data Protection
Collaboration with Security Experts
Regulatory Compliance

5. Accountability and Transparency

It’s often difficult to understand or explain the decisions made by complex AI systems, creating a “black box” problem. This causes challenges in assigning responsibility for errors or harm and complicates regulatory compliance and legal proceedings.

The COMPAS (Correctional Offender Management Profiling for Alternative Sanctions) AI system was used in US courts to predict the likelihood of criminal reoffending. However, an investigative report found that COMPAS was biased against African Americans and lacked transparency in its decision-making. The report identified three major problems:

Judges and lawyers could not understand how COMPAS reached its conclusions.
The AI disproportionately predicted higher recidivism rates for Black defendants.
The system operated as a “black box,” with no independent review.

Based on this case, AI models in legal decision-making now require:

Transparent documentation,
AI tools used in courts must pass fairness assessments before deployment and
Most importantly, many jurisdictions banned fully automated risk assessments without human review.

So by implementing explainability, auditing, human oversight, regulatory compliance and stakeholder engagement, AI systems can become more accountable and transparent.

Recommended Tools, Techniques, Practices and Frameworks for Improved Accountability and Transparency of Agentic AI Solutions

Use model-agnostic techniques like LIME (Local Interpretable Model-Agnostic Explanations) or SHAP (Shapley Additive Explanations) to provide insights into AI decisions.
Use an Explainable AI (XAI) toolkit.
Use “Model Cards” (a framework by Google) to document AI behaviour, training data and performance metrics.
Publish algorithmic impact assessments (AIA) before deploying high-risk AI.
Establish third-party AI audit teams to assess compliance and ethical risks.
Use active learning where AI seeks human input in uncertain situations.
Adopt frameworks like the EU’s AI Act, NIST AI Risk Management Framework and GDPR AI governance rules.
Use fairness testing tools like Fairness Indicators and AI Fairness 360 (IBM) to detect biases.
Publish transparency reports about how AI models impact users.

6. Dependence and Over-Reliance

Tesla’s Autopilot system, an advanced driver-assistance AI, has been involved in multiple fatal accidents where drivers over-relied on AI and disengaged from driving responsibilities. Despite the manufacturer’s warning, drivers believed the system was fully autonomous and even ignored alerts prompting them to keep their hands on the wheel.

The problem was that the Autopilot did not always escalate warnings forcefully in the events when drivers became unresponsive.

To solve this issue, Tesla now requires drivers to periodically touch the steering wheel to ensure engagement. The system was also updated to activate more aggressive visual and auditory warnings if the driver fails to take control.

But there is another underlying problem. Over-reliance on agentic AI can lead to the erosion of critical human skills caused by blind trust in automated systems. This can easily lead to system-wide failures when AI malfunctions that can even turn deadly.

AI should assist rather than replace human decision-makers, especially in high-risk sectors. Human operators must maintain their expertise and should not entirely rely on or become dependent on AI. For example, after the Air France Flight 447 crash in 2009, where pilots failed to react properly when autopilot disengaged, airlines introduced mandatory manual flying hours to prevent skill degradation. The same thing could happen to software development and software evolution if we fail to timely address this problem.

To sum up, to prevent dependence and over-reliance on agentic AI, organisations should:

Maintain human oversight and decision authority.
Train workers to retain manual skills.
Implement AI uncertainty indicators.
Create manual override and fail-safe systems.
Use hybrid human-AI decision-making models.
Ensure AI explainability and transparency.
Follow regulatory best practices.

7. Reliability and Accuracy

Mitigation Strategies for Reliability and Accuracy in Agentic AI - visual presentation — (click to enlarge/download)

Agentic AI systems may fail to make consistent, accurate decisions in dynamic, uncertain or adversarial environments. Consequently, they may cause catastrophic errors in critical domains.

Regardless, AI-powered chatbots are increasingly used for medical symptom analysis for example. However, AI lacks real-world clinical experience, hallucinates, can fail to identify rare conditions and has no self-checking mechanism. In other words, most LLMs we use daily do not verify their own answers before outputting query results.

Let’s use case studies and real-world examples to see how to improve accuracy so we can rely more on Agentic AI.

Google’s Med-PaLM 2, for instance, initially struggled with accuracy due to biased training data. The company was forced to improve reliability by training on diverse multi-institutional datasets.

Uber’s self-driving car fatally struck a pedestrian in 2018 due to poor real-world validation. Waymo, by contrast, conducted millions of real-world and simulated test miles, reducing failure rates before public deployment. Waymo proved that AI models must undergo rigorous validation and real-world scenario testing before deployment.

IBM Watson for Oncology initially provided incorrect treatment recommendations due to limited training data. The company introduced real-time physician feedback loops, allowing the model to improve through expert corrections. AI could now detect errors and self-correct in real time thanks to feedback loops and improved confidence scoring.

Another way to improve the decision accuracy of Agentic AI is to use multiple AI models. It’s called ensemble learning where multiple models provide independent predictions and vote on final decisions while using backup rule-based systems for high-risk decisions. The best example is NASA’s Mars Rover AI Navigation which uses redundant AI models to cross-validate terrain analysis before making navigation decisions. This prevents mission-critical failures caused by single-model inaccuracies.

Arguably the best approach to developing a reliable and accurate Agentic AI is to force the AI to explain its decisions and flag uncertain predictions for human review. This can be done by incorporating XAI techniques and implementing confidence thresholds that trigger human intervention for low-confidence results. For example, Healthcare AI (DeepMind’s Kidney Disease Prediction) flagged high-risk cases with explainability reports, allowing doctors to verify predictions before acting.

The bottom line is that AI should never operate autonomously in critical situations. In other words, deploy AI as decision support rather than an autonomous agent and mandate manual approval for AI-generated recommendations in high-risk industries. It brings us back to the Boeing 737 MAX MCAS incident where a faulty AI-driven flight stabilisation system overrode pilot inputs, leading to fatal crashes.

The Key Takeaways

To improve reliability and accuracy, organisations should:

Train AI on high-quality unbiased datasets.
Conduct real-world testing and validation.
Implement real-time error detection and self-correction.
Use redundancy (multi-model AI systems) to cross-verify decisions.
Apply explainability techniques (XAI) to flag uncertain predictions.
Ensure regulatory compliance and third-party auditing.
Require human oversight in critical decision-making.

Conclusion

Agentic AI presents immense opportunities but also introduces critical risks such as:

Loss of control
Ethical dilemmas
Security threats
Lack of transparency
Over-reliance
Accuracy failures.

To mitigate these, technology leaders must prioritise human oversight, robust security measures and explainability while enforcing strict governance frameworks.

AI should be an assistive tool, not an autonomous decision-maker in high-risk domains. In other words, human expertise remains central.

Success in deploying agentic AI hinges on continuous validation, adversarial testing, regulatory alignment and adaptive learning models. Organisations that proactively address these challenges will drive trustworthy, resilient and high-impact AI adoption, positioning themselves as industry leaders in safe and scalable AI innovation.