Monarch geneset OGS2.0

DPOGS211528
TranscriptDPOGS211528-TA2103 bp
ProteinDPOGS211528-PA700 aa
Genomic positionDPSCF300354 + 271704-276790
RNAseq coverage885x (Rank: top 14%)
Annotation
HeliconiusHMEL0132210.057.89% 
BombyxBGIBMGA003841-TA6e-14961.06% 
DrosophilaTsf3-PA5e-12236.24% 
EBI UniRef50UniRef50_B4J5J22e-12037.31%GH20842 n=1 Tax=Drosophila grimshawi RepID=B4J5J2_DROGR
NCBI RefSeqXP_001663123.17e-12738.69%transferrin [Aedes aegypti]
NCBI nr blastpgi|1571340511e-12538.69%transferrin [Aedes aegypti]
NCBI nr blastxgi|910764422e-12638.12%PREDICTED: similar to transferrin [Tribolium castaneum]
Group
Gene OntologyGO:00068792.5e-137cellular iron ion homeostasis
GO:00068262.5e-137iron ion transport
GO:00081992.5e-137ferric iron binding
GO:00055762.5e-137extracellular region
KEGG pathway 
InterPro domain[1-645] IPR0155602.5e-137Transferrin precursor
[1-645] IPR0011562.5e-137Peptidase S60, transferrin lactoferrin
Orthology groupMCL15763 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211528-TA
ATGTTACGTCTGTGCGTCGTGGAAGGTCGCGGTCCGTTCAAACGGGGCGCAACTTTCTGTCCCATTCTGGAAGAGGAGAATTCTGGTGTGGAGTGTGTCCTGGGAGCTGATCAACTGGACTGCCTGCGTCGCATTAGTAAAGGCACGGTTGACTTCGGAGTGTTCAGCCCCGAGGATCTGATAGCGGCGCAGTGGGCCAACATTGACGTTCTAGTCACTAACGAGATTAGGCAGAGGCATAGACCATACGAACGGAGAATAGTGTCCGTAGTCAATCGTCGTATCCTGTTCGAGACCGATGCGTCTTTGTCAAATGTTCTCAGGAACAGTACTCTGTGCCACCCGGGCGTGGGAGTTGACGACCTCCGACCTATGTCAGATACACTCGCTGAGTACCTGGAGTCCCTCATCATCCCTCGCTCCTGTGAGCCCGAGCTGACGCTCACTGAGAACCACATCAAGGCTGTGTCCAGCTTCTTCTACAAGGCCTGTAAGGCCGGACCCTGGGTTCCAGACAAACAACGCGACGCGGCGCTAAAGAAGAAGTATCCAAATCTGTGTGGAGCGTGCGCGAGCCCGGATTGTTCTACAAACGACAAATACTGGGGCCCGCTTGGAGCCCTGCAGTGTTTGGGGGACGACGCGGGCGACGCCATGTGGGGGGAGATGGATGACGTCATGTACTTCTTTGGGGTGAACACATCGTCCCCGGCCGGTTCCGCTGCTAATATCGATTCCTTTGCCTACTTGTGTCGCGACGGGAGCTCCCAGCCCGTCGCCGGGAACATGGCACCCTGCGTGTGGCTCAACAGACCCTGGCCCGTCGTTATAGCGAAGCGGAAGGCGTCCGCGTCAGTGAGTTCCCTGGTGTCCTCGTTGAAGGAGGAGGACCTGTTCAGTAGCAACTGGCGCGAAGCCCTGGCCTCACTACTCGAGGTCCGCCGGGCCCCCGTCACAGCCCACATGAAGGCCCCCCTAGACTACCTCGCATCAGCTCGGGGCTTCAGGGAAGCTTACAGTCAGAGCGGATGTGATCCGCCGAGGCACATAACAATGTGTACGAATTCGTTGTTGGAGAAGAACAAGTGCGAGTGGCTGAGCGAGGCGGGCGCCGTGTACGGGGTGAAGCCTCCGCTGCAGTGCACCATGACGGCGGGGGCTGCGGACTGTCTGAGGAGCGTCAGGGACGGCGCTAGTGACGTCACTGTACAGCACAGCGACTGGCTGCCGGTGGGCAGCAGGTTTTTCGATTTGAAGCCTGTGCTATACGAAGTGACTCCTATAATGGAAAAAATGGACACAGTGATGGCCTACGTCAGACGCGACGCCAATATAAACGACATGGCGGAACTGCGCGGGAAAAGGGCCGCGTTCCCGAGCTACGACGGCGTCGCGTGGCATTCGGTGTTCAAATACATAGCGGATAAAGAGAACTACAACTGCTACGACGCCGTGCACGATTACTTCAGTGAAATCTGTGCACCGGGAGTCGAAGTCAGTGGATTCGAAGTGGATATTACGGATAGATATACGAGGAACTGCTACAAGGACGGGGACGAGGTGGTGAGGGGAGAGGTGGCGGCGCTGAGGTCGGTCGTGGAAGGGAAGAGTGATGTGGCGTTCATTAGTATGAGAACTGTTAGAATGTACCAAAGTAACCTAATAAACGAGCCGTGGTCTAACACCCTCGTGGACCTGAAGCCGGTCTGTCCCGAGGAGAACAGGAAATACTGTTACATATCCTGGTCGAACATCGGTCACATTTACGCCAAACGAAACATCACAGCTATGAGGAAACACGAAATAATCACGATGTTCACGAAGTTAGATCAATTGTTCGGCAAACATCAACCCTTCCATAGCGTCATGTTCACGATGTACGGGCCGTTTAATCACCAGACCGATGTCATATTCCACAACAACACTAAAAGCCTATCAACTGACAACTTATTTAAAACGCATCCATACGTCTCTATGCCGTTCAACTTTGACAGTACGTTTAATGACAGCGTCTCTCACACGGACATAACGAGCACAGCGTATAAAATGGCGCCAACAGTTGTATTGTTGGCGGCTATTTTAAGTATATTAGTCATAAATTAG

Protein sequence:

>DPOGS211528-PA
MLRLCVVEGRGPFKRGATFCPILEEENSGVECVLGADQLDCLRRISKGTVDFGVFSPEDLIAAQWANIDVLVTNEIRQRHRPYERRIVSVVNRRILFETDASLSNVLRNSTLCHPGVGVDDLRPMSDTLAEYLESLIIPRSCEPELTLTENHIKAVSSFFYKACKAGPWVPDKQRDAALKKKYPNLCGACASPDCSTNDKYWGPLGALQCLGDDAGDAMWGEMDDVMYFFGVNTSSPAGSAANIDSFAYLCRDGSSQPVAGNMAPCVWLNRPWPVVIAKRKASASVSSLVSSLKEEDLFSSNWREALASLLEVRRAPVTAHMKAPLDYLASARGFREAYSQSGCDPPRHITMCTNSLLEKNKCEWLSEAGAVYGVKPPLQCTMTAGAADCLRSVRDGASDVTVQHSDWLPVGSRFFDLKPVLYEVTPIMEKMDTVMAYVRRDANINDMAELRGKRAAFPSYDGVAWHSVFKYIADKENYNCYDAVHDYFSEICAPGVEVSGFEVDITDRYTRNCYKDGDEVVRGEVAALRSVVEGKSDVAFISMRTVRMYQSNLINEPWSNTLVDLKPVCPEENRKYCYISWSNIGHIYAKRNITAMRKHEIITMFTKLDQLFGKHQPFHSVMFTMYGPFNHQTDVIFHNNTKSLSTDNLFKTHPYVSMPFNFDSTFNDSVSHTDITSTAYKMAPTVVLLAAILSILVIN-