⚠️(WIP) Training Issues

Understanding AI Training Cycles

AI systems, particularly those based on machine learning (ML) and deep learning (DL), require extensive training cycles using large datasets to learn and make predictions or decisions. These cycles can be broadly categorized into three phases: data collection, model training, and model validation. Each phase presents unique security challenges. Let's generalize some possible attacks under those three phases.

Possible Data Collection Security Issues:

Data Poisoning: Malicious actors may introduce corrupted data into the dataset, aiming to skew the AI model's learning process. Discovery typically involves data validation techniques and anomaly detection to identify outliers that do not fit the expected data distribution.
Privacy Leaks: Collecting data from individuals without adequate consent or security measures can lead to privacy breaches. Discovery involves auditing data collection processes and employing privacy-preserving techniques like differential privacy.

Risks:

Skewed AI decisions, potentially causing financial loss or reputational damage.
Legal repercussions from privacy violations.

Possible Model Training Security Issues:

Adversarial Attacks: During training, models may be susceptible to adversarial examples designed to mislead AI predictions. These can be discovered through robustness testing, where the model is exposed to various manipulated inputs to assess its response.
Overfitting to Sensitive Data: If a model overfits its training data, it might inadvertently reveal sensitive information through its predictions. Techniques like model auditing and implementing generalization measures (e.g., regularization) can help identify and mitigate this issue.

Risks:

Compromised decision-making leads to security vulnerabilities.
Unintentional data leakage, compromising user confidentiality.

Possible Model Validation Security Issues:

Insufficient Testing: Failing to thoroughly test the model against a wide range of scenarios can leave unseen vulnerabilities. This can be discovered through comprehensive testing, including stress and scenario-based tests, to evaluate the model's performance across diverse conditions.
Bias and Fairness: Models might exhibit biased behaviour if not properly validated for fairness, which can be discovered through fairness assessments and bias mitigation techniques.

Risks:

Inadequate model performance under unexpected conditions, potentially endangering users.
Ethical and legal issues from biased decision-making.

(WIP) How can you find the vulnerabilities in the training parts?

- Training cycle frequency and how data is provided into those cycles - Feedback mechanisms - Platform related issues - Privacy matters - Synthetic vs. Real Data -

PreviousStandard Input: Prompt Injection Next(WIP) Multi-Modal LLM Application Security Testing

Last updated 1 year ago

Was this helpful?