Data Lineage Tracking for AI: Complete Guide | Quiz

By Eyal Doron / December 6, 2025 / 1 minute of reading

1 / 8

1. Why is the misconception that we can add lineage later dangerous?

1. Lineage can easily be added at any time

2. You cannot reconstruct transformation history from final outputs so lineage must be built from the start

3. Final outputs contain all transformation history

4. Retrofitting lineage takes only a few hours

2 / 8

2. What does the EU AI Act require regarding training data according to the article?

1. Documentation is optional for all risk levels

2. No documentation is required for any AI systems

3. Training data documentation for high-risk systems and demonstrable traceability requirements

4. Only the model output needs to be documented

3 / 8

3. How does lineage support GDPR right to erasure according to the article?

1. Erasure only requires deleting the original source data

2. GDPR does not apply to AI training data

3. Lineage shows which models were trained on a person's data enabling accurate deletion compliance

4. Lineage automatically deletes data when requested

4 / 8

4. Why is transformation code versioning essential according to the article?

1. Capturing Git hash lets you know exactly which code version processed the data

2. It is only needed for compliance audits

3. It makes the code run faster

4. It reduces storage costs

5 / 8

5. What metadata should be captured during the data collection stage?

1. Only metadata required by the AI model

2. Just the database connection string

3. Source system identification – collection timestamps – consent and permission metadata

4. Only the file size and format

6 / 8

6. Why does feature engineering obscure data origins according to the article?

1. Feature engineering deletes the original data

2. Engineering transforms data into unreadable formats

3. Derived features like ratios and aggregations create indirect connections to dozens of underlying data points

4. Features are stored in different databases than source data

7 / 8

7. What three critical questions does lineage answer according to the article?

1. What data – what transformations – what model version

2. Who accessed – when accessed – why accessed

3. Where stored – when backed up – who owns it

4. How much – how fast – how accurate

8 / 8

8. According to the article – what analogy best describes data lineage for AI?

1. A backup system that stores copies of all data

2. A family tree for your data showing origin and transformations and destination

3. A firewall that protects data from unauthorized access

4. An encryption system that secures data at rest

Your score is

The average score is 0%

Data Lineage Tracking for AI: Complete Guide | Quiz

🔐 The AI Security Manager's Newsletter

About The Author

Eyal Doron

Leave a Comment Cancel Reply

🔐 The AI Security Manager's Newsletter

About The Author

Eyal Doron

Related Posts

Leave a Comment Cancel Reply