Monarch geneset OGS2.0

DPOGS215003
TranscriptDPOGS215003-TA1110 bp
ProteinDPOGS215003-PA369 aa
Genomic positionDPSCF300256 + 44002-48048
RNAseq coverage34x (Rank: top 74%)
Annotation
HeliconiusHMEL0132432e-8449.39% 
BombyxBGIBMGA012221-TA6e-10151.81% 
DrosophilaTsf1-PA4e-2625.07% 
EBI UniRef50UniRef50_Q7Q3282e-4632.30%AGAP011453-PA n=7 Tax=Culicidae RepID=Q7Q328_ANOGA
NCBI RefSeqXP_001869079.11e-4737.18%transferrin [Culex quinquefasciatus]
NCBI nr blastpgi|1700690213e-4637.18%transferrin [Culex quinquefasciatus]
NCBI nr blastxgi|1700690215e-4636.54%transferrin [Culex quinquefasciatus]
Group
Gene OntologyGO:00068791.2e-27cellular iron ion homeostasis
GO:00068261.2e-27iron ion transport
GO:00081991.2e-27ferric iron binding
GO:00055761.2e-27extracellular region
KEGG pathway 
InterPro domain[4-305] IPR0155601.2e-27Transferrin precursor
[4-305] IPR0011561.2e-27Peptidase S60, transferrin lactoferrin
Orthology groupMCL17560 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215003-TA
ATGGTTTGGCAGAGAGTTCTCTTTACGATCGCAGTGGCCGCAGGTGTGACGGCACAGAATGATGTCCGTGTTTGCGTGCCCAAGTTGTTCGCACCATACTGTGACAGCTTGCAGAGTCTTGGCAGTCCAGTCGTCTGTGATGGAGTGGAATCCAGTGTTGACTGCCTCACAAGACTGAATAGAGGGGCTTCAGACTTCGGAGTGTTCTCTGATGAGGATATGGTGTTGATGGCACACAAACAACCCGACAGGAACAGGGTCGTGGCGTCCGTTAGAGATGTTCTAAGAAAAGATGCCTACGCCTTCGAAGCGGTCGCTGTAGTCCCAGCGAGTCACGCTGGAGGTCTAGAGGGTCTCCGGGGTATGAAGTACTGTCACCCTGGTCTGGACGACTCGGAGCCAAGCTGGTCTCCACGAGTGCAGAAGACTTTGGAACGAGCTGCTGCCAAAATCGATAGCTGTCCTGATGCAGAAATGGAAGTGCAAACTCTGAGCTCCTTCTTCCACAGCGCTTGTAGACCTGGACCTTGGAGTGATGATGACAATGTTGATGCTGATCTTAAATCCCGATTCTCGAACCTGTGCGCTCTGTGTGGCCAGAACGCGAAGTGTTCCAAGTACACCATCAACATGGGCCCCAACATCAATAACGTTGACAACAACAACAGACACATCCAGGCCTTGGAGTGCATGAGGTCCAATGGGAACAACACTTTCGTTTACGCCGCCTGGCAACACGTCCGCACGTACTTTAACATACGCAATCCTCGTATCCGTGCGTCATACTCTCTCCTCTGTGAAGACGGCTCCCTCCGTCCACTGACCCTTGAAGCCACTATTTCCGACGTCTCTCCTTGTTCGTTCGTCAAACAGCCTTGGGGCGCTATTGTTGCTTCTTCACCGAGAGCAAACCAAGCGAGCAGTGCCATCAAGAGCTGGTGGCCTTCGGGCTCCAACCCTGGAGGCAACACTTGGCAGTCAGTGCTGTATACGGCTTTAATAGGAAGCTTCCCGAACGTTGTGTACTTCGAAGACGGTCTGCCTACACCAGGATCTTACATCCAAAGTGGCAACTTCACCTCGATTGATGACTCCTCATCTTGCATCTGA

Protein sequence:

>DPOGS215003-PA
MVWQRVLFTIAVAAGVTAQNDVRVCVPKLFAPYCDSLQSLGSPVVCDGVESSVDCLTRLNRGASDFGVFSDEDMVLMAHKQPDRNRVVASVRDVLRKDAYAFEAVAVVPASHAGGLEGLRGMKYCHPGLDDSEPSWSPRVQKTLERAAAKIDSCPDAEMEVQTLSSFFHSACRPGPWSDDDNVDADLKSRFSNLCALCGQNAKCSKYTINMGPNINNVDNNNRHIQALECMRSNGNNTFVYAAWQHVRTYFNIRNPRIRASYSLLCEDGSLRPLTLEATISDVSPCSFVKQPWGAIVASSPRANQASSAIKSWWPSGSNPGGNTWQSVLYTALIGSFPNVVYFEDGLPTPGSYIQSGNFTSIDDSSSCI-