Monarch geneset OGS2.0

DPOGS215553
TranscriptDPOGS215553-TA2304 bp
ProteinDPOGS215553-PA767 aa
Genomic positionDPSCF300129 + 666751-674133
RNAseq coverage778x (Rank: top 17%)
Annotation
HeliconiusHMEL0034360.075.28% 
BombyxBGIBMGA000826-TA0.073.40% 
DrosophilaTsf2-PA0.047.85% 
EBI UniRef50UniRef50_Q9VTZ50.047.85%LD22449p n=19 Tax=Endopterygota RepID=Q9VTZ5_DROME
NCBI RefSeqXP_001841784.10.048.92%lactotransferrin [Culex quinquefasciatus]
NCBI nr blastpgi|1700277980.048.92%lactotransferrin [Culex quinquefasciatus]
NCBI nr blastxgi|1700277980.048.66%lactotransferrin [Culex quinquefasciatus]
Group
Gene OntologyGO:00068794.8e-272cellular iron ion homeostasis
GO:00068264.8e-272iron ion transport
GO:00081994.8e-272ferric iron binding
GO:00055764.8e-272extracellular region
KEGG pathway 
InterPro domain[11-730] IPR0011564.8e-272Peptidase S60, transferrin lactoferrin
Orthology groupMCL11513 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215553-TA
ATGCAAGACAAATCGCTATTCGGCAAAGATTGGATAGAAATTCAATGCAAGAGGGCATTTGACACTGAAGAGTGTATGACATGGGTTGACAACCGAGTGGCATCTCTCCTGGCCCTGGATGCTGGGGAGGTCTATGTGGCTGGTCGTTACCATTCATTAGTGCCAATATTACAAGAATTATATGGCAGGAACGAACCATATCAATATACAGTGGCTGTAGTCAAAAAAGGCAGTCTGCTGGCTGTTCAGCCTGAATCTGGACTTCATGGCCTAAGAGGAGCCAGAGCCTGTTTCCCACTTGTCGGATCTCTAGCCGGCTGGGTCATGCCAATACATGTTCTCATGCAAGAAGGTGGTCTCAAAATAACAGATTGTAATAATCATGTGAAGTCAGCCGTTGAATACTTTGGGGAATCCTGTGCACCAAATTCATTGAAAGATATCTATAACCCCATAGGTGACAACCCTGATAAACTATGCAAACTATGTAGTGGTGAGGCAGGCATTCGTTGTACACTTGCGGATCCATATGCTGGTTACGAAGGGGCTCTTAAATGTTTGGTTGCCAATAATACTGGAGATATAGCATTTGTTAGAGACACTACTATACAACATGCATTGCTGTCCGGGAAGATATTGGGTGGTGTGACAGCGTCTAGCTTTGAGTTGATATGCCGTGATGGCTCACGAGCTGAGGTAACACAATGGGAACATTGTCATTGGGGAAGAGTCCCCGCTGATGCTATCGTAACCAGCAGTGCCGCTACCATCGCTCAGAGGACTAAATATCAAAATATTCTTATGAAAATACTTGAACTGTATGGAGAACCAAATCCAGAGAATCGGAATTCAAATAGAACTAACGATTTTTTGAACAATGATCCATATGTCTACAGCTCTACTACAGACAGAACGAAATACCAATATGACTATAATAGAGACAATTATAGGAATGGAAGTGCATCGTCACAACCTTTCCAACTATTTCTATCAAATGGCACAACGGATTTATTGGTACAGGACGCCACTATAAACTTTCGAGTGCTGAAGGAAAGTGAGCAAGTCGCCAAACACATATTGAACAACGAATTTGTTGGCGATCAAGCCGAACGAGCTGTGACCGGTATACGAGACTGTCCGGTGAAGCGGGCGATACTATGCGTCACCAGTGAAGCTGAAATGGAGAAATGTATTAAAATGAGGGTTGCCTTGAAAGCTGCGTTTATGTCTCCGACGTTCTCATGCTGGCGCGCCCACAGTACCCGCCACTGTGAGCGAGCGATCGCGGAAGGCACGTGCGACTTCGCCCTGTTCGACGCCGCTGACATGTTGCACGCCGCTTACAGACACCGACTGGTGCCGTTCATGCAGGAGGTGTATACAAGTGGTGATAACTGGTACTATGCTGTGGCGGTCGCTAAAGAACAGGATCCAGACACGGATTTGACCTACTTAAGAGGAAAAAATACATGTCACACCGGCATTGGCATGGCCGCTGGTTGGGTGTACCCTCTGGCTTACCTTATATCCAACGGATGGATTCGTCCGTACGGTTGTGACGGTGCTCAAGCGGCTGCCCAGTACTTCAGTAAGTCGTGTGCATCTGGTTCACTGTCTGCGGAGTACGTCGACGCCAATACAGTACCCCACGATAATCTCTGTCATTTATGTCATGGTGCCTCGTTTAGGCGTTGTCGTCGTGATGCCAACGAACCGTACTACGGGCACGTGGGTGCGCTACGGTGTATGGTGGAGGGAGGGGGGGATGTGGCGTTTGTTAGACACACAGCCCTCACTGAGGTCACCGGAGGTAGGAGACGAGAGTGGTGGGCGAGGGACCTGCTCCCTGATGACCTACAACTGCTGTGCCCGGACGGCACTCGTGAAAAAATGCACGAGTACAAGAAATGTAATCTCGGTAAGGTGCCGGGATCTGTTTTGATGGGGCGAGCGAACCACACTGAACTGGACACTTACTCCAATCTCATGGTGTACGCACAGCAATTATACGGCGCTACGATGACTGATGAATTCAGCTTTAGTATGTTTTACTCAATGCCGCCTTACGCCGATTTGATATTCAGCGATTCAGCCGTCCGTCTAAAGCCGCTTTCACACAAGATGCGTTCCGCTGAAATTATAGCGGGTAGAGCTTTGCCTCGAGCGGCTAGGATAGTGTCGTGTGACGCGCCACAGGCTTCGTATTATTTCGCATCAGATCCAGATTTCTTATCGAGTGCGTACAAGCAAGGTGTCATAGGACATCTGATCGCGTTTGCGATATTATTGATAGCATTATTATAA

Protein sequence:

>DPOGS215553-PA
MQDKSLFGKDWIEIQCKRAFDTEECMTWVDNRVASLLALDAGEVYVAGRYHSLVPILQELYGRNEPYQYTVAVVKKGSLLAVQPESGLHGLRGARACFPLVGSLAGWVMPIHVLMQEGGLKITDCNNHVKSAVEYFGESCAPNSLKDIYNPIGDNPDKLCKLCSGEAGIRCTLADPYAGYEGALKCLVANNTGDIAFVRDTTIQHALLSGKILGGVTASSFELICRDGSRAEVTQWEHCHWGRVPADAIVTSSAATIAQRTKYQNILMKILELYGEPNPENRNSNRTNDFLNNDPYVYSSTTDRTKYQYDYNRDNYRNGSASSQPFQLFLSNGTTDLLVQDATINFRVLKESEQVAKHILNNEFVGDQAERAVTGIRDCPVKRAILCVTSEAEMEKCIKMRVALKAAFMSPTFSCWRAHSTRHCERAIAEGTCDFALFDAADMLHAAYRHRLVPFMQEVYTSGDNWYYAVAVAKEQDPDTDLTYLRGKNTCHTGIGMAAGWVYPLAYLISNGWIRPYGCDGAQAAAQYFSKSCASGSLSAEYVDANTVPHDNLCHLCHGASFRRCRRDANEPYYGHVGALRCMVEGGGDVAFVRHTALTEVTGGRRREWWARDLLPDDLQLLCPDGTREKMHEYKKCNLGKVPGSVLMGRANHTELDTYSNLMVYAQQLYGATMTDEFSFSMFYSMPPYADLIFSDSAVRLKPLSHKMRSAEIIAGRALPRAARIVSCDAPQASYYFASDPDFLSSAYKQGVIGHLIAFAILLIALL-