*3.2. Genome Data Mining and Annotation of P450s*

Genome data mining of P450s and their annotation was carried out following the procedure described in the literature [54,55]. Briefly, individual cyanobacterial species proteomes were downloaded from KEGG and subjected to NCBI Batch Web CD-Search Tool analysis [74]. After analysis, the results were analyzed and proteins that belong to the P450 superfamily were selected. The selected proteins were subjected to BLAST analysis at http://bioshell.pl/p450/blast\_only.html as part of the P450

page at http://www.p450.unizulu.ac.za/. Based on the percentage identity to the named homolog P450s, the proteins were then annotated (assigning P450 family and subfamily), following the International P450 Nomenclature Committee rule, i.e., sequences with >40% identity were assigned to the same family as named homolog P450 and sequences with >55% identity to the same subfamily as named homolog P450 [60–62]. Proteins with less than 40% identity to a named homolog P450 were assigned to a new P450 family.
