Monarch geneset OGS2.0

DPOGS214960
TranscriptDPOGS214960-TA3759 bp
ProteinDPOGS214960-PA1252 aa
Genomic positionDPSCF300358 + 61996-120968
RNAseq coverage276x (Rank: top 39%)
Annotation
HeliconiusHMEL0173650.090.63% 
BombyxBGIBMGA013969-TA0.073.62% 
DrosophilaCG31619-PC0.041.59% 
EBI UniRef50UniRef50_D6WQN30.047.78%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WQN3_TRICA
NCBI RefSeqXP_974990.20.048.20%PREDICTED: similar to papilin [Tribolium castaneum]
NCBI nr blastpgi|1892394510.048.20%PREDICTED: similar to papilin [Tribolium castaneum]
NCBI nr blastxgi|1892394510.048.40%PREDICTED: similar to papilin [Tribolium castaneum]
Group
Gene OntologyGO:00082372e-19metallopeptidase activity
GO:00082702e-19zinc ion binding
GO:00055782e-19proteinaceous extracellular matrix
GO:00082331.9e-08peptidase activity
KEGG pathway 
InterPro domain[50-68] IPR0132732e-19Peptidase M12B, ADAM-TS
[755-833] IPR0137831.2e-15Immunoglobulin-like fold
[37-89] IPR0008842e-13Thrombospondin, type 1 repeat
[755-820] IPR0130982.6e-11Immunoglobulin I-set
[1221-1251] IPR0109091.9e-08PLAC
Orthology groupMCL10970 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214960-TA
ATGTTGATATGCTTGCAGATACCTGGTCGGTGTGCCCACAGACATGGTCATGACGTCACCACGGAACTGCCCGAACAATTGGACGCGCATCACGACAACACTTCACACCATGGAAAGGATCGCGGAGGATGGTCTCAATGGAGCGAGTGGTCACTGTGCTCTAGGACCTGTGACGGCGGTATCTCTCGTCAGCTAAGAACCTGTTCATCACCAGCTGGCTGCAGAGGGGAACCAGTTAGATACAAGATATGCAACATGCAGCCATGTCGCGGTAACGTTTCACTTCTAGAAGAGGATATATGGAGGGAACAAGAGTGCAGTTCTCATGATAACACTCCCTATGGAGGAGAATTGTTTCATTGGAGATCTCATAGAGACGACGATGAACCGTGCGCTCTAACGTGTCGTGGTACGCCACAGCATTCTGGACAAAGCCCTGAACCGACCATGGCTCTGGATGAGGAGGAACGAGTGGTGGTGGCAGTTCTAGCAGCCCGAGTATCTGACGGTACCCGATGTAGACCTGGGAGCTTAGATATGTGCATTGACGGCCGATGTCAGCGTGTTGGCTGTGACTTGCGAGTCGGTTCTACTCGTCGAGTCGATGAATGCGGTGTTTGTGGTGGGGACGGGTCATCATGTTCGAGACCAAGATACCACTGGCTTTCAACTCCAGGATCCTTGTGCTCCGCTACATGTGGTGGTGGTTATAAAATGTCTCTGGCCGTGTGTCGTGACCGACTAACTGGGTTGGATGCACCCGAGGAACTCTGCGATGGCTCCAGAAAACCAGCATCAGCTGTGGTGCGATGTAATACACATCCCTGTCCATTCAAGTGGTATGTGGGCGAGTGGTCATCTTGCAGTGTGACGTGTGGAGGAGGTGTACGGTCACGGCGAGTGTTGTGCGCGAGATCTGCCAATGTAACGAGGACAGACACTTACGATCCTGAGACAAGTTCTGAACCTGGTTGCTTAACACCTGCACCTCGTTCGACGCAGCCTTGTAATGATCACAGCTGTCCCACGTGGCTTGCAGGCGCCTGGTCAGGGTGTTCAGTATCCTGCGGTGAAGGTGTACAAGTTCGTGGGGTCGAATGTACCCCAGCTGGTGGCGGATGTGATCCAGCGACAAGACCAGAAATCTCCAGATCCTGCTCAACAGGCATAAACTGCCCGATATACAGAGAGCCTGAGGAGCCTGAGGACGACATAGAAGCACTACTCCCAGGTGTAGTTTATCACACCCAGCCCTTAATACAACAATATCCAGCTGCCGAAAGACTTGTAGGAGAACCAGACGTACCCGTTGAAGCTACTTACATAAAAGATGACGAATGGACACCGTGTAGCGTAACATGTGGCGAAGGATGGCGGAAAAAGGAGGTGCATTGCAAGATATTCTTAGAATTCAGTAGGACCATAGCAAAGTTACCTGATAGCAAATGCATGGGCCCAAAACCAACGGAAGAAACCGAGAGGTGTGTCATGGAACCTTGCTCTATGGCATACGGAACTTCGTTTGGAGATTCAAGTGCACCTGCTTATAACGGTGGAGACAGGTCGTTGATATTCGGTACATCGAGTAACATAAGAGTAGCACCAGGTTCACCAGGGAAGTCCTATTCTTGGAAGGAAAAGGGTTATACTAGCTGCAGTGCATCTTGTTTGAGCGGTGTACAGGAGCTTATAATACAATGCGTCCGAGACGAAGACGGCAAGAACGCGTCTCCATACATGTGCGATCCATTGACAAAGCCGGAGAATAGGGTGCGTACCTGCAACGACCACCCGTGTCCGCCAAGGTGGAACTACACGGAATTTTCTCAATGCACCAAGTCGTGCGGAATCGGCATTCAAACTCGCGAAGTCACCTGTATCCATGAGGTAACGAGAGGTGGTACAAATACCGTGGTGGTACCAAACAGTATGTGTCCTCAACCTCCACCACCTGACCGTCAGTACTGTAACGTACTTGACTGTCCTGTCAGGTGGCATGCCGGCGACTGGTCCAAATGCTCCAAGACTTGTGGAGGAGGAGTCAAGCAGAGAGAAGTGATATGCAAACAAATAATGGCTCAGTCGCACGTAGTCGAGCGACCGTCGTCTCAATGCAGTTCACCGAGACCTGCGACAACAAAGTCGTGCAACAGTCGCCCGTGTCTGTTGGATACCTCGTCGCCGGAAATATCATTGGCAAACTCCTCATACATACAGCATGATCCGAAAAAGAAAAAGGTGACAGTAAAAGTGGGGGGTTCAGCGACGATATTTTACGGGACACAGGTGAAGATAAAATGTCCGGTCAAAGGTTACAACAGAACCAAAATACAGTGGGCCAAAGATCATCAGATTATAACCAAGTCGAAGAAATACAAGATATCTAAAAAGGGAGCTCTCCGTATAACTTCTCTTTCCCTCCGCGACCATGGAGTCTATACTTGCGTGGCGGGAAGGTCAAGCGCAAACCTGACCTTACTTGTGAAACCTCGCCCGGGTGAATTCCCATCCAGCGAGGAAATTGAAAGACATAAGGCCTTGGACGAACCTTCCTCACCACTTTCAGACAGAGCGGATGGTAGATATCGAGCGATGGTAGGTGGTCGATCTGACGATCAGTCCCATGAACAGCGGCCACCAGACCAGAAAAAGAATTACAAGAGTCGACAGAAAGGCAAAATTGACAAAGTACGTGACGCGTTATATGGAAGTGCAACAACGAAAGCCTCACCAAGTTACTCTCAAGCCAGAGATATTTACGATGAAAATGAGATGCTCCCGCCGAAAAGTAACGGCGCCTCTCGTCTATTACCATGTCTGCACTATCTTGTTATGCAGCTTCAGACAACCGGAAATTCGAGGGGTCAAAGGATGGTTGATCCCATCATCATGTACCAGAATTACGGATCACCAACACAGGCGGAGGTCGTTAACCTTCAGGACAAACAAATCGTTTTCCCTTACGACGATGACTCGGACATCATAATAGTTAACGAAGATTACAACAAGAAAACATTTGAAAGTACTGACAAATCTGATATGATTGAAACGACTACAACATTAGAACCGCAAAAGATAACCGCAACCGATGTACATGAATATATGTGGACCACAACATTGTGGTCGACTTGTTCCGCCCCGTGTGGACAAAGTGGACATCAGATAAGAGGTGCTATTTGTCAACATAAAGTCCAGAACACTACAACATCGGTAGTGACAGATGAGTGTATATCACGTGGACTAACTGCGCCTTCAGTGATGCGTAATTGTGAGACTGATGGATGCGCAACTTGGAAGGCTGGCGACTGGTCTCCACCGAGATGCCTTCTCAGTGGAACAGCTATAATCCGTCGTCGAGTAGAATGTGTGAGCGATAACGGTACGCTAGTCTCAGACTCGGCGTGTGTATACAGCGAGCGACCCGAGCATTTGCGTCGCGTACAACCTTGTAGAGCAGTCTGGTCTGTGGGTCCCTGGAGCAAGTGCAAAGGCCCTTGCGGTGAGAGCAAACAGCACCGCGTGCTGCGCTGCGTGTGGCGAGCACCCTCCATACAAGGAAATACACGCACGAGAAGAGAAAGACCTGCCGCCGCCTGTGTACAAGAACGGCCTCCAGTTGCAAGGGATTGTAAGCAGAGTAACTGTGTCAGGGATGCTGTTTGCAGAGACACCTCACGTTTCTGTGAGAACGTCCGCGCCATGAATATGTGCGCGCTCCAGCGCTACCAGAGGCAGTGCTGCAAGACCTGCGAGGATTAA

Protein sequence:

>DPOGS214960-PA
MLICLQIPGRCAHRHGHDVTTELPEQLDAHHDNTSHHGKDRGGWSQWSEWSLCSRTCDGGISRQLRTCSSPAGCRGEPVRYKICNMQPCRGNVSLLEEDIWREQECSSHDNTPYGGELFHWRSHRDDDEPCALTCRGTPQHSGQSPEPTMALDEEERVVVAVLAARVSDGTRCRPGSLDMCIDGRCQRVGCDLRVGSTRRVDECGVCGGDGSSCSRPRYHWLSTPGSLCSATCGGGYKMSLAVCRDRLTGLDAPEELCDGSRKPASAVVRCNTHPCPFKWYVGEWSSCSVTCGGGVRSRRVLCARSANVTRTDTYDPETSSEPGCLTPAPRSTQPCNDHSCPTWLAGAWSGCSVSCGEGVQVRGVECTPAGGGCDPATRPEISRSCSTGINCPIYREPEEPEDDIEALLPGVVYHTQPLIQQYPAAERLVGEPDVPVEATYIKDDEWTPCSVTCGEGWRKKEVHCKIFLEFSRTIAKLPDSKCMGPKPTEETERCVMEPCSMAYGTSFGDSSAPAYNGGDRSLIFGTSSNIRVAPGSPGKSYSWKEKGYTSCSASCLSGVQELIIQCVRDEDGKNASPYMCDPLTKPENRVRTCNDHPCPPRWNYTEFSQCTKSCGIGIQTREVTCIHEVTRGGTNTVVVPNSMCPQPPPPDRQYCNVLDCPVRWHAGDWSKCSKTCGGGVKQREVICKQIMAQSHVVERPSSQCSSPRPATTKSCNSRPCLLDTSSPEISLANSSYIQHDPKKKKVTVKVGGSATIFYGTQVKIKCPVKGYNRTKIQWAKDHQIITKSKKYKISKKGALRITSLSLRDHGVYTCVAGRSSANLTLLVKPRPGEFPSSEEIERHKALDEPSSPLSDRADGRYRAMVGGRSDDQSHEQRPPDQKKNYKSRQKGKIDKVRDALYGSATTKASPSYSQARDIYDENEMLPPKSNGASRLLPCLHYLVMQLQTTGNSRGQRMVDPIIMYQNYGSPTQAEVVNLQDKQIVFPYDDDSDIIIVNEDYNKKTFESTDKSDMIETTTTLEPQKITATDVHEYMWTTTLWSTCSAPCGQSGHQIRGAICQHKVQNTTTSVVTDECISRGLTAPSVMRNCETDGCATWKAGDWSPPRCLLSGTAIIRRRVECVSDNGTLVSDSACVYSERPEHLRRVQPCRAVWSVGPWSKCKGPCGESKQHRVLRCVWRAPSIQGNTRTRRERPAAACVQERPPVARDCKQSNCVRDAVCRDTSRFCENVRAMNMCALQRYQRQCCKTCED-