Data Lineage Tracking for AI: Complete Guide | Quiz

By Eyal Doron / December 6, 2025 / 1 minute of reading

1 / 8

1. Why does the article say lineage adds minimal overhead despite concerns?

1. Lineage requires no resources at all

2. Async metadata capture and proper tooling minimize impact while missing lineage costs far exceed implementation

3. Overhead concerns only apply to real-time systems

4. Only large enterprises need to worry about overhead

2 / 8

2. Why is the misconception that we can add lineage later dangerous?

1. Lineage can easily be added at any time

2. You cannot reconstruct transformation history from final outputs so lineage must be built from the start

3. Final outputs contain all transformation history

4. Retrofitting lineage takes only a few hours

3 / 8

3. What does the EU AI Act require regarding training data according to the article?

1. Training data documentation for high-risk systems and demonstrable traceability requirements

2. Documentation is optional for all risk levels

3. No documentation is required for any AI systems

4. Only the model output needs to be documented

4 / 8

4. How does lineage support GDPR right to erasure according to the article?

1. Erasure only requires deleting the original source data

2. Lineage shows which models were trained on a person's data enabling accurate deletion compliance

3. Lineage automatically deletes data when requested

4. GDPR does not apply to AI training data

5 / 8

5. What is the critical link for backward lineage according to the article?

1. Database foreign keys

2. Network connection between servers

3. Model-to-data linkage connecting each trained model to its training dataset versions

4. API authentication tokens

6 / 8

6. What metadata should be captured during the data collection stage?

1. Only metadata required by the AI model

2. Source system identification – collection timestamps – consent and permission metadata

3. Just the database connection string

4. Only the file size and format

7 / 8

7. Why does feature engineering obscure data origins according to the article?

1. Features are stored in different databases than source data

2. Feature engineering deletes the original data

3. Derived features like ratios and aggregations create indirect connections to dozens of underlying data points

4. Engineering transforms data into unreadable formats

8 / 8

8. What three critical questions does lineage answer according to the article?

1. How much – how fast – how accurate

2. Who accessed – when accessed – why accessed

3. What data – what transformations – what model version

4. Where stored – when backed up – who owns it

Your score is

The average score is 0%

Data Lineage Tracking for AI: Complete Guide | Quiz

🔐 The AI Security Manager's Newsletter

About The Author

Eyal Doron

Leave a Comment Cancel Reply

🔐 The AI Security Manager's Newsletter

About The Author

Eyal Doron

Related Posts

Leave a Comment Cancel Reply