Process-Driven Modelling of Media Forensic Investigations-Considerations on the Example of DeepFake Detection
Abstract
:1. Introduction
- 1.
- Development of standards, technical guidelines, test criteria and test methods: Currently, there exist no such standards that are sufficiently suitable for assessing the security and reliability of AI systems for critical contexts (such as health care, finance health care, finance, etc.). There is also a lack of security benchmarks for less critical applications (with a few exceptions).
- 2.
- Research effective countermeasures against AI-specific attacks: The existing measures for such attacks are often insufficient. In order to ensure a secure and robust operation of AI systems, further countermeasures must be researched.
- 3.
- Research into methods of transparency and explainability: The often inadequate explainability of AI systems has a significant influence on their Information Technology (IT) security and causes a lack of acceptance of the systems.
- The need for modelling forensic processes is reasoned upon.
- A concept for modelling media forensic investigation pipelines is derived from established guidelines.
- The applicability of such modelling is illustrated on the example of a media forensic investigation pipeline focusing on the detection of DeepFake videos. It is important to already mention at this point, that the DeepFake detectors, test criteria and test methods used in this paper are used for illustrative purposes on the processes and are not claiming to represent the state-of-the-art in detector research.
- The benefits of such a planned realisation of AI-based investigation methods are discussed.
2. State of the Art on Forensic Process Modelling for Media Forensics and DeepFake Detection
- Lawfulness: “refers to the need […] to adopt adequate, accessible, and foreseeable laws with sufficient precision and sufficient safeguards whenever the use of the detection technology, […], could interfere with fundamental rights and freedoms”.
- Fairness: “points to the need for being transparent about the use of the technology. Furthermore, it is obvious that the use of the detection methods should be restricted to well-defined legitimate purposes, […]”.
2.1. Forensic Process Modelling for Media Forensics
2.1.1. Forensic Process Modelling Requirements and Best Practices (US Perspective)
- Qualification of a witness as expert: First, a witness has to qualify as an expert. The conclusion of this process is that the presiding judge decides whether the witness may offer opinion testimony as an expert.
- Type of knowledge considered: The first seven words of FRE rule 702 specify different types of knowledge (e.g., scientific, technical or other specialised knowledge) that an expert can offer.
- Who is addressed by the expert: Basically, there are two entities the expert has to convince. First, the judge, to get admitted in pre-trial hearings, and second the ‘fact finder’ (the “trier of fact” in FRE rule 702 [10], either a jury in normal cases or a judge in non-jury trials) at the trial itself.
- Qualification: Any expert has to testify upon the five criteria listed in FRE rule 702 “knowledge, skill, experience, training, or education” [10]. This information helps the judge to decide whether an expert can be admitted to trial in a specific case and helps the ‘fact finder’ (i.e., usually the jury) to assign corresponding weights to each expert’s testimony in the decision process.
“whether the expert’s technique or theory can be or has been tested – that is, whether the expert’s theory can be challenged in some objective sense, or whether it is instead simply a subjective, conclusory approach that cannot reasonably be assessed for reliability”;
“whether the technique or theory has been subject to peer review and publication”;
“the known or potential rate of error of the technique or theory when applied”;
“the existence and maintenance of standards and controls”;
“whether the technique or theory has been generally accepted in the scientific community”.
“Strengthen scientific foundations of digital/multimedia evidence by developing systematic and coherent methods for studying the principles of digital/multimedia evidence to assess the causes and meaning of traces in the context of forensic questions, as well as any associated probabilities.”
“Assess ways to mitigate cognitive bias in cases that require an understanding of the context of traces in order to analyze digital/multimedia evidence, […]”
“Establish effective ways to evaluate and express probative value of digital/multimedia traces for source level and activity level conclusions. This includes studying how quantitative evaluation of digital/multimedia evidence can be constructed for different forensic questions, […] as well as studying how such evaluative results can be communicated to decision-makers.”
2.1.2. The German Perspective
2.2. (Brief) Summary on the Domains of DeepFake Generation and Detection
2.2.1. DeepFake Use Cases
2.2.2. DeepFake Detection
3. Related Work and the Derived Challenge for This Paper
3.1. The Data-Centric Examination Approach (DCEA)
3.2. Model Adaptation for Media Forensic Tasks
3.3. The Challenge Addressed in This Paper
4. Materials & Methods for the Design of a Process-Driven Investigation Model for DeepFake Detection
- Organisational: Specifying the method (as an investigation workflow) and establishing its constraints, limitations and potential errors attached to the method and/or its application.
- Technical: Buying and installation of the investigation environment (e.g., forensic workstations) and all required infrastructure (including software such as police casework systems as well as a suitable chain of custody realisation for digital assets).
- Personnel: Hiring, training and (re-)certification of experts for applying the method.
4.1. Modelling of Operator Units
4.2. Orchestration of Operators into an Investigation Context
4.3. Evaluation Best Practices and Publicly Available Benchmarking Data Sets
4.3.1. Evaluation Best Practices
- Very thoroughly benchmark under different training and evaluation scenarios (see [4]) the individual expert systems (here detectors) to be used in the fusion to precisely establish their requirements and capabilities as well as the error rates attached.
- Benchmark different fusion schemes under different training and evaluation scenarios (see [40]) and establish the impact of different weighting strategies onto the (detection) performance and error patterns.
- Consider decision confidences (where available) into the opinion forming.
- Allow for auditability as well as human oversight for the entire process.
4.3.2. Publicly Available (Benchmarking) DeepFake Data Sets
5. Application of the Updated Process Modelling to Describe a Fusion-Based DeepFake Detector
5.1. Templating (In SP) the Empirical Investigations for This Paper
5.1.1. Data Sets Used for Training and Benchmarking
5.1.2. Pre-Existing Detectors Re-Used in This Paper
5.1.3. Detectors Newly Implemented for This Paper
5.1.4. Fusion Operators and Weight Estimation
5.2. Instantiation of the Pipeline for the Evaluations in This Paper
5.2.1. Data Sets Used for Evaluation
5.2.2. Single Detector Evaluation Results
5.2.3. Results of the Fusion-Based Detection
6. Results
6.1. Experimental Evaluation Results and Comparison with the SOTA/Related Work
- The comparison of the investigation results and the differences experienced when looking at the performances on the TIMIT-DF and Celeb-DF data sets indicate a sensitivity of trained detection approaches to specific DeepFake generation methods. In consequence two alternative strategies for compensating this sensitivity should be explored: Generalisation or specialisation of the training scenario for detectors. For the first alternative, training sets with large heterogeneous DeepFake parts would be required, potentially resulting in models with a high false positive rate due to the fact that the model component(s) characterising the DeepFake class are very dispersed in the feature space. For the second alternative, targeted training for the different DeepFake generation would be required, effectively transforming the task into an n-class problem.
- Extensive benchmarking of detectors is required for any application of forensic methods. What is true for single detectors, becomes even more relevant when combining single expert systems into a fusion approach. The practical evaluations summarised in Section 5.2 above show how adding two detectors, which are performing individually better than the probability of guessing correctly (which would be ), negatively impairs a fusion outcome. What has not been reflected upon in the discussions made in Section 5 is that the question of fairness and bias are also becoming much more complex in the context of fusion: Out of the five detectors used within this paper, three are concentrating on the eye regions. This effectively leverages the weight estimation for fusion weights, which were made under the implicit assumption of the independence of involved detectors.
6.2. Lessons Learned during the Templating and Instantiating of the Pipeline in SP/OP
7. Conclusions and Discussion
8. Future Work
8.1. Extending the Presented Modelling and Evaluation Work
8.2. Demystifying Machine Learning and AI Systems
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Bundesamt für Sicherheit in der Informationstechnik (BSI). Sicherer, Robuster und nachvollziehbarer Einsatz von KI-Probleme, Maßnahmen und Handlungs-Bedarfe; BSI: Bonn, Germany, 2021. [Google Scholar]
- Champod, C.; Vuille, J. Scientific Evidence in Europe-Admissibility, Evaluation and Equality of Arms. Int. Comment. Evid. 2011, 9. [Google Scholar] [CrossRef] [Green Version]
- BSI. Leitfaden IT-Forensik; German Federal Office for Information Security: Bonn, Germany, 2011. [Google Scholar]
- Siegel, D.; Kraetzer, C.; Seidlitz, S.; Dittmann, J. Media Forensics Considerations on DeepFake Detection with Hand-Crafted Features. J. Imaging 2021, 7, 108. [Google Scholar] [CrossRef]
- Siegel, D.; Krätzer, C.; Seidlitz, S.; Dittmann, J. Forensic Data Model for Artificial Intelligence based Media Forensics-Illustrated on the Example of DeepFake Detection. In Media Watermarking, Security, and Forensics 2022-Electronic Imaging 2022; Alattar, A., Nasir Memon, G.S., Eds.; Society for Imaging Science and Technology IS&T: Springfield, VA, USA, 2022. [Google Scholar]
- Rathgeb, C.; Tolosana, R.; Vera-Rodriguez, R.; Busch, C. (Eds.) Handbook of Digital Face Manipulation and Detection From DeepFakes to Morphing Attacks; Springer: Berlin/Heidelberg, Germany, 2022. [Google Scholar]
- Ho, A.T.S.; Li, S. Handbook of Digital Forensics of Multimedia Data and Devices; Anthony, T.S., Li, S., Eds.; Department of Computing and Surrey Centre for Cyber Security (SCCS), University of Surrey: Guildford, UK; Wiley/IEEE Press: Hoboken, NJ, USA, 2015. [Google Scholar]
- U.S. Congress. Federal Rules of Evidence; Amended by the United States Supreme Court Apr. 26, 2011, eff. Dec. 1, 2011; U.S. Congress: Washington, DC, USA, 2011.
- SWGFAST. The Fingerprint Sourcebook; Scientific Working Group on Friction Ridge Analysis, Study and Technology (SWGFAST); National Institute of Justice, U.S. Department of Justice: Gaithersburg, MD, USA, 2011. [Google Scholar]
- LLI. Federal Rules of Evidence-FRE702; Legal Information Institute, Cornell Law School (LLI): Ithaca, NY, USA, 2010. [Google Scholar]
- LLI. Federal Rules of Evidence-Notes on FRE702; Legal Information Institute, Cornell Law School (LLI): Ithaca, NY, USA, 2010. [Google Scholar]
- Krätzer, C. Statistical Pattern Recognition for Audio-Forensics-Empirical Investigations on the Application Scenarios Audio Steganalysis and Microphone Forensics. Ph.D. Thesis, Otto-von-Guericke-Universität Magdeburg, Magdeburg, Germany, 2013. [Google Scholar]
- USC. United States Court (USC) 509 U.S. 579; Daubert v. Merrell Dow Pharmaceuticals, Inc.: Washington, DC, USA, 1993. [Google Scholar]
- USCA. United States Court of Appeals (USCA), Ninth Circuit. No. 90–55397; Argued and Submitted March 22, 1994. Decided January 4, 1995, 1995. Daubert, William and Joyce Daubert, individually and as Guardians Ad Litem for Jason Daubert, (a minor); Anita De Young, individually, and as Guardian Ad Litem for Eric Schuller, Plaintiffs-Appellants, vs. Merrell Dow Pharmaceuticals, Inc., a Delaware corporation, Defendant-Appellee; USCA: San Francisco, CA, USA, 1995. [Google Scholar]
- U.S. Congress. Frye v. United States, 293 F. 1013 (D.C. Cir.); U.S. Congress: Washington, DC, USA, 1923.
- Meyers, M.; Rogers, M. Computer Forensics: The Need for Standardization and Certification. Int. J. Digit. Evid. 2004, 3, 1–11. [Google Scholar]
- Nelson, B.; Phillips, A.; Steuart, C. Guide to Computer Forensics and Investigations, 4th ed.; Course Technology: Boston, MA, USA, 2010. [Google Scholar]
- Bijhold, J.; Ruifrok, A.; Jessen, M.; Geradts, Z.; Ehrhardt, S.; Alberink, I. Forensic audio and Visual Evidence 2004–2007: A Review. In Proceedings of the 15th INTERPOL Forensic Science Symposium, Lyon, France, 23–26 October 2007. [Google Scholar]
- Daeid, N.N.; Houck, M. (Eds.) Interpol’s Forensic Science Review; Taylor & Francis Inc.: Abingdon, UK, 2010. [Google Scholar]
- Ashcroft, J.; Daniels, D.J.; Hart, S.V. Forensic Examination of Digital Evidence: A Guide for Law Enforcement; U.S. Department of Justice-National Institute of Justice: Washington, DC, USA, 2004. [Google Scholar]
- Casey, E. Digital Evidence and Computer Crime: Forensic Science, Computers, and the Internet; Academic Press: Cambridge, MA, USA, 2011. [Google Scholar]
- Bartholomew, P. Seize First, Search Later: The Hunt for Digital Evidence. Touro Law Rev. 2014, 30, 1027–1052. [Google Scholar]
- Daniel, L.E.; Daniel, L.E. Digital Forensics for Legal Professionals: Understanding Digital Evidence from the Warrant to the Courtroom; Syngress: Washington, DC, USA, 2015. [Google Scholar]
- Pollit, M.; Casey, E.; Jaquet-Chiffelle, D.O.; Gladyshev, P. A Framework for Harmonizing Forensic Science Practices and Digital/Multimedia (OSAC Technical Series Publication 0002R1); Organization of Scientific Area Committees (OSAC): Gaithersburg, MD, USA, 2019. [Google Scholar]
- Kiltz, S. Data-Centric Examination Approach (DCEA) for a Qualitative Determination of Error, Loss and Uncertainty in Digital and Digitised Forensics. Ph.D. Thesis, Otto-von-Guericke-Universität Magdeburg, Fakultät für Informatik, Magdeburg, Deutschland, 2020. [Google Scholar]
- Kiltz, S.; Dittmann, J.; Vielhauer, C. Supporting Forensic Design-A Course Profile to Teach Forensics. In Proceedings of the 2015 Ninth International Conference on IT Security Incident Management and IT Forensics, Magdeburg, Germany, 18–20 May 2015; pp. 85–95. [Google Scholar]
- Flaglien, A.; Sunde, I.M.; Dilijonaite, A.; Hamm, J.; Sandvik, J.P.; Bjelland, P.; Franke, K.; Axelsson, S. Digital Forensics; John Wiley & Sons, Ltd.: Hoboken, NJ, USA, 2017. [Google Scholar]
- Mirsky, Y.; Lee, W. The Creation and Detection of Deepfakes: A Survey. ACM Comput. Surv. 2022, 54, 1–41. [Google Scholar] [CrossRef]
- Di Filippo, M.; Froede, S. Deep(C)Phishing: Next Level Vishing & Phishing; 18. Deutscher IT-Sicherheitskongress 1.-2. Februar 2022-DIGITAL; Bundesamt für Sicherheit in der Informationstechnik (BSI): Bonn, Germany, 2022. [Google Scholar]
- Nguyen, T.T.; Nguyen, C.M.; Nguyen, D.T.; Nguyen, D.T.; Nahavandi, S. Deep Learning for Deepfakes Creation and Detection: A Survey. arXiv 2019, arXiv:1909.11573. [Google Scholar] [CrossRef]
- Li, Y.; Chang, M.; Lyu, S. In Ictu Oculi: Exposing AI Generated Fake Face Videos by Detecting Eye Blinking. arXiv 2018, arXiv:1806.02877. [Google Scholar]
- McCloskey, S.; Albright, M. Detecting GAN-generated Imagery using Color Cues. arXiv 2018, arXiv:1812.08247. [Google Scholar]
- Agarwal, S.; Farid, H.; Gu, Y.; He, M.; Nagano, K.; Li, H. Protecting World Leaders Against Deep Fakes. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, CVPR Workshops 2019, Long Beach, CA, USA, 16–20 June 2019; pp. 38–45. [Google Scholar]
- Yang, X.; Li, Y.; Lyu, S. Exposing Deep Fakes Using Inconsistent Head Poses. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2019, Brighton, UK, 12–17 May 2019; pp. 8261–8265. [Google Scholar] [CrossRef] [Green Version]
- Jung, T.; Kim, S.; Kim, K. DeepVision: Deepfakes Detection Using Human Eye Blinking Pattern. IEEE Access 2020, 8, 83144–83154. [Google Scholar] [CrossRef]
- Soukupová, T.; Cech, J. Real-Time Eye Blink Detection Using Facial Landmarks. In Proceedings of the 21st Computer Vision Winter Workshop, Rimske Toplice, Slovenia, 3–5 February 2016. [Google Scholar]
- Altschaffel, R. Computer Forensics in Cyber-Physical Systems: Applying Existing Forensic Knowledge and Procedures from Classical IT to Automation and Automotive. Ph.D. Thesis, Otto-von-Guericke-Universität Magdeburg, Fakultät für Informatik, Magdeburg, Germany, 2020. [Google Scholar]
- Zhang, C.; Liu, C.; Zhang, X.; Almpanidis, G. An up-to-date comparison of state-of-the-art classification algorithms. Expert Syst. Appl. 2017, 82, 128–150. [Google Scholar] [CrossRef]
- Li, Y.; Yang, X.; Sun, P.; Qi, H.; Lyu, S. Celeb-DF: A Large-Scale Challenging Dataset for DeepFake Forensics. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA, 13–19 June 2020; pp. 3204–3213. [Google Scholar] [CrossRef]
- Kraetzer, C.; Makrushin, A.; Dittmann, J.; Hildebrandt, M. Potential advantages and limitations of using information fusion in media forensics a discussion on the example of detecting face morphing attacks. EURASIP J. Inf. Secur. 2021, 2021, 9. [Google Scholar] [CrossRef]
- European Commission. On Artificial Intelligence-A European Approach to Excellence and Trust. COM(2020) 65 Final. 2020. Available online: https://ec.europa.eu/info/sites/default/files/commission-white-paper-artificial-intelligence-feb2020_en.pdf (accessed on 14 September 2021).
- European Commission. Proposal for a Regulation of the European Parliament and of the Council Laying down Harmonised Rules on Artificial Intelligence (Artificial Intelligence Act) and Amending Certain Union Legislative Acts. COM(2021) 206 Final. 2021. Available online: https://eur-lex.europa.eu/legal-content/EN/TXT/?qid=1623335154975&uri=CELEX%3A52021PC0206 (accessed on 14 September 2021).
- Cohen, J. A Coefficient of Agreement for Nominal Scales. Educ. Psychol. Meas. 1960, 20, 37–46. [Google Scholar] [CrossRef]
- Eugenio, B.D.; Glass, M. The Kappa Statistic: A Second Look. Comput. Linguist. 2004, 30, 95–101. [Google Scholar] [CrossRef]
- Frank, E.; Hall, M.A.; Witten, I.H. The WEKA Workbench. Online Appendix for Data Mining: Practical Machine Learning Tools and Techniques; Morgan Kaufmann: Burlington, MA, USA, 2016. [Google Scholar]
- Landis, J.; Koch, G. The measurement of observer agreement for categorical data. Biometrics 1977, 33, 159–174. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Sim, J.; Wright, C.C. The Kappa Statistic in Reliability Studies: Use, Interpretation, and Sample Size Requirements. Phys. Ther. 2005, 85, 257–268. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Reichow, B. Evidence-Based Practices and Treatments for Children with Autism; Springer: Berlin/Heidelberg, Germany, 2011. [Google Scholar]
- Korshunov, P.; Marcel, S. DeepFakes: A New Threat to Face Recognition? Assessment and Detection. arXiv 2018, arXiv:1812.08685. [Google Scholar]
- Rössler, A.; Cozzolino, D.; Verdoliva, L.; Riess, C.; Thies, J.; Nießner, M. FaceForensics++: Learning to Detect Manipulated Facial Images. In Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, ICCV 2019, Seoul, Korea, 27 October–2 November 2019; pp. 1–11. [Google Scholar] [CrossRef] [Green Version]
- Dufour, N.; Gully, A. Contributing Data to Deepfake Detection Research. 2019. Available online: https://ai.googleblog.com/2019/09/contributing-data-to-deepfake-detection.html (accessed on 9 September 2021).
- Dolhansky, B.; Howes, R.; Pflaum, B.; Baram, N.; Canton Ferrer, C. The Deepfake Detection Challenge (DFDC) Preview Dataset. arXiv 2019, arXiv:1910.08854. [Google Scholar]
- Dolhansky, B.; Howes, R.; Pflaum, B.; Baram, N.; Canton Ferrer, C. The DeepFake Detection Challenge Dataset. arXiv 2020, arXiv:2006.07397. [Google Scholar]
- Jiang, L.; Li, R.; Wu, W.; Qian, C.; Loy, C.C. DeeperForensics-1.0: A Large-Scale Dataset for Real-World Face Forgery Detection. In Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA, 13–19 June 2020; pp. 2886–2895. [Google Scholar] [CrossRef]
- Khalid, H.; Tariq, S.; Kim, M.; Woo, S.S. FakeAVCeleb: A Novel Audio-Video Multimodal Deepfake Dataset. arXiv 2021, arXiv:2108.05080. [Google Scholar]
- Chung, J.S.; Nagrani, A.; Zisserman, A. VoxCeleb2: Deep Speaker Recognition. arXiv 2018, arXiv:1806.05622. [Google Scholar]
- Huang, J.; Wang, X.; Du, B.; Du, P.; Xu, C. DeepFake MNIST+: A DeepFake Facial Animation Dataset. arXiv 2021, arXiv:2108.07949. [Google Scholar]
- Zi, B.; Chang, M.; Chen, J.; Ma, X.; Jiang, Y.G. WildDeepfake: A Challenging Real-World Dataset for Deepfake Detection. In Proceedings of the 28th ACM International Conference on Multimedia, Seattle, WA, USA, 12–16 October 2020; Association for Computing Machinery: New York, NY, USA, 2020; pp. 2382–2390. [Google Scholar]
- Kwon, P.; You, J.; Nam, G.; Park, S.; Chae, G. KoDF: A Large-Scale Korean DeepFake Detection Dataset. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), Montreal, BC, Canada, 11–17 October 2021; pp. 10744–10753. [Google Scholar]
- Jain, A.; Korshunov, P.; Marcel, S. Improving Generalization of Deepfake Detection by Training for Attribution. In Proceedings of the International Workshop on Multimedia Signal Processing (MMSP), Tampere, Finland, 6–8 October 2021. [Google Scholar]
- McCool, C.; Marcel, S.; Hadid, A.; Pietikäinen, M.; Matejka, P.; Cernocký, J.H.; Poh, N.; Kittler, J.; Larcher, A.; Lévy, C.; et al. Bi-Modal Person Recognition on a Mobile Phone: Using Mobile Phone Data. In Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, Melbourne, VIC, Australia, 9–13 July 2012; pp. 635–640. [Google Scholar]
- Sanderson, C.; Lovell, B. Multi-Region Probabilistic Histograms for Robust and Scalable Identity Inference. Lect. Notes Comput. Sci. 2009, 5558, 199–208. [Google Scholar] [CrossRef] [Green Version]
- Korshunova, I.; Shi, W.; Dambre, J.; Theis, L. Fast Face-Swap Using Convolutional Neural Networks. In Proceedings of the IEEE International Conference on Computer Vision, ICCV 2017, Venice, Italy, 22–29 October 2017; pp. 3697–3705. [Google Scholar] [CrossRef] [Green Version]
- Rössler, A.; Cozzolino, D.; Verdoliva, L.; Riess, C.; Thies, J.; Nießner, M. FaceForensics: A Large-scale Video Dataset for Forgery Detection in Human Faces. arXiv 2018, arXiv:1803.09179. [Google Scholar]
- Zi, B.; Ma, X.; Chang, M.; Chen, J.; Jiang, Y.G. Deepfake In The Wild. Available online: https://github.com/KnightofDawn/deepfake_in_the_wild (accessed on 29 March 2022).
- King, D.E. Dlib-ml: A Machine Learning Toolkit. J. Mach. Learn. Res. 2009, 10, 1755–1758. [Google Scholar]
- John, G.H.; Langley, P. Estimating Continuous Distributions in Bayesian Classifiers. In Eleventh Conference on Uncertainty in Artificial Intelligence; Morgan Kaufmann: San Mateo, CA, USA, 1995; pp. 338–345. [Google Scholar]
- Weka documentation for NaiveBayes. Available online: https://weka.sourceforge.io/doc.dev/weka/classifiers/bayes/NaiveBayes.html (accessed on 25 March 2022).
- Chang, C.C.; Lin, C.J. LIBSVM: A library for support vector machines. ACM Trans. Intell. Syst. Technol. 2011, 2, 27:1–27:27. [Google Scholar] [CrossRef]
- Landwehr, N.; Hall, M.; Frank, E. Logistic Model Trees. Mach. Learn. 2005, 59, 161–205. [Google Scholar] [CrossRef] [Green Version]
- Sumner, M.; Frank, E.; Hall, M. Speeding up Logistic Model Tree Induction. In Proceedings of the 9th European Conference on Principles and Practice of Knowledge Discovery in Databases, Porto, Portugal, 3–7 October 2005; pp. 675–683. [Google Scholar]
- Weka Documentation for SimpleLogistic. Available online: https://weka.sourceforge.io/doc.dev/weka/classifiers/functions/SimpleLogistic.html (accessed on 25 March 2022).
- Cohen, W.W. Fast Effective Rule Induction. In Proceedings of the Twelfth International Conference on Machine Learning, Morgan Kaufmann, Tahoe City, CA, USA, 9–12 July 1995; pp. 115–123. [Google Scholar]
- Weka Documentation for JRip. Available online: https://weka.sourceforge.io/doc.dev/weka/classifiers/rules/JRip.html (accessed on 25 March 2022).
- Quinlan, J.R. C4.5: Programs for Machine Learning; Morgan Kaufmann Publishers Inc.: San Francisco, CA, USA, 1993. [Google Scholar]
- Weka Documentation for J48. Available online: https://weka.sourceforge.io/doc.dev/weka/classifiers/trees/J48.html (accessed on 25 March 2022).
- Øe, H.S. Opinion of Advocate General-Case C-401/19 Republic of Poland vs European Parliament; Council of the European Union: Luxembourg, 2021. [Google Scholar]
- Kirby, B. Expert Witnesses Link Camera to Child Porn Found on Defendant Nathan Railey’s Laptop; Advance Local: New York, NY, USA, 2011. [Google Scholar]
- Goljan, M.; Fridrich, J.J.; Filler, T. Large scale test of sensor fingerprint camera identification. In Proceedings of the Media Forensics and Security I, part of the IS&T-SPIE Electronic Imaging Symposium, San Jose, CA, USA, 19 January 2009; Delp, E.J., Dittmann, J., Memon, N.D., Wong, P.W., Eds.; Volume 7254, p. 72540. [Google Scholar] [CrossRef] [Green Version]
- Hernandez, E.; Schwettmann, S.; Bau, D.; Bagashvili, T.; Torralba, A.; Andreas, J. Natural Language Descriptions of Deep Visual Features. arXiv 2022, arXiv:2201.11114. [Google Scholar]
Phases | Description (According to [25]) |
---|---|
Strategic preparation (SP) | Includes measures taken by the operator of an IT system and by the forensic examiners in order to support a forensic investigation prior to an incident |
Operational preparation (OP) | Includes measures of preparation for a forensic investigation after the detection of a suspected incident |
Data gathering (DG) | Includes measures to acquire and secure digital evidence |
Data investigation (DI) | Includes measures to evaluate and extract data for further investigation |
Data analysis (DA) | Includes measures for detailed analysis and correlation between digital evidence from various sources |
Documentation (DO) | Includes measures for the detailed documentation of the proceedings, also for the transformation into a different form of description for the report of the incident |
Data Type | Description |
---|---|
MFDT1 Digital input data | The initial media data considered for the investigation. |
MFDT2 Processed media data | Results of transformations to media data (e.g., greyscale conversion, cropping) |
MFDT3 Contextual data | Case specific information (e.g., for fairness evaluation) |
MFDT4 Parameter data | Contain settings and other parameter used for acquisition, investigation and analysis |
MFDT5 Examination data | Including the traces, patterns, anomalies, etc that lead to an examination result |
MFDT6 Model data | Describe trained model data (e.g., face detection and model classification data) |
MFDT7 Log data | Data, which is relevant for the administration of the system (e.g., system logs) |
MFDT8 Chain of custody & report data | Describe data used to ensure integrity and authenticity (e.g., hashes and time stamps) as well as the accompanying documentation for the final report. |
Kappa Value | Agreement According to [46] | Confidence Mapping Used Here |
---|---|---|
No agreement | Poor | |
Slight agreement | Poor to fair | |
Fair agreement | ||
Moderate agreement | Fair to good | |
Substantial agreement | ||
Almost perfect agreement | Good |
Data Set | # Individuals | # Real Video | # DeepFake Video |
---|---|---|---|
UADFV [34] | 49 | 49 | 49 |
TIMIT-DF [62,63] | 43 | 559 | 640 |
FaceForensics++ [50,64] | ? | 1000 | 4000 |
DFD [51] | 28 | 363 | 3068 |
Celeb-DF [39] | 59 | 890 | 5639 |
DFDC [53] | 960 | 23,654 | 104,500 |
DeeperForensics [54] | 100 | 50,000 | 10,000 |
WildDeepfake [58,65] | ? | 3805 | 3509 |
DeepFakeMnist+ [57] | ? | 10,000 | 10,000 |
FakeAVCeleb [55] | 490 | 20,000+ | 20,000+ |
KoDF [59] | 403 | 62,166 | 175,776 |
DF-Mobio [60] | 72 | 31,950 | 14,546 |
Detector | NaiveBayes | LibSVM | Simple Logistics | JRip | J48 |
---|---|---|---|---|---|
0.0695 | 0.3254 | 0.3508 | 0.3678 | 0.2966 | |
0.2162 | 0.0063 | 0.3275 | 0.2480 | 0.2273 |
Detector | on | on TIMIT-DF |
---|---|---|
0.0191 | 0.552 | |
0.0408 | 0.433 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Kraetzer, C.; Siegel, D.; Seidlitz, S.; Dittmann, J. Process-Driven Modelling of Media Forensic Investigations-Considerations on the Example of DeepFake Detection. Sensors 2022, 22, 3137. https://doi.org/10.3390/s22093137
Kraetzer C, Siegel D, Seidlitz S, Dittmann J. Process-Driven Modelling of Media Forensic Investigations-Considerations on the Example of DeepFake Detection. Sensors. 2022; 22(9):3137. https://doi.org/10.3390/s22093137
Chicago/Turabian StyleKraetzer, Christian, Dennis Siegel, Stefan Seidlitz, and Jana Dittmann. 2022. "Process-Driven Modelling of Media Forensic Investigations-Considerations on the Example of DeepFake Detection" Sensors 22, no. 9: 3137. https://doi.org/10.3390/s22093137
APA StyleKraetzer, C., Siegel, D., Seidlitz, S., & Dittmann, J. (2022). Process-Driven Modelling of Media Forensic Investigations-Considerations on the Example of DeepFake Detection. Sensors, 22(9), 3137. https://doi.org/10.3390/s22093137