Monarch geneset OGS2.0

DPOGS214999
TranscriptDPOGS214999-TA1557 bp
ProteinDPOGS214999-PA518 aa
Genomic positionDPSCF300256 - 31433-39457
RNAseq coverage398x (Rank: top 30%)
Annotation
HeliconiusHMEL0132434e-11154.55% 
BombyxBGIBMGA012221-TA4e-13559.95% 
DrosophilaTsf3-PA9e-3127.73% 
EBI UniRef50UniRef50_Q7Q3284e-5332.02%AGAP011453-PA n=7 Tax=Culicidae RepID=Q7Q328_ANOGA
NCBI RefSeqXP_001869079.13e-5837.37%transferrin [Culex quinquefasciatus]
NCBI nr blastpgi|1700690216e-5737.37%transferrin [Culex quinquefasciatus]
NCBI nr blastxgi|1700690219e-5837.37%transferrin [Culex quinquefasciatus]
Group
Gene OntologyGO:00068795.4e-32cellular iron ion homeostasis
GO:00068265.4e-32iron ion transport
GO:00081995.4e-32ferric iron binding
GO:00055765.4e-32extracellular region
KEGG pathway 
InterPro domain[85-513] IPR0155605.4e-32Transferrin precursor
[85-513] IPR0011565.4e-32Peptidase S60, transferrin lactoferrin
Orthology groupMCL17560 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214999-TA
ATGGGTTGGAAGAAAATTCTCTTAGCGGTCACGCTGATAGCCGGCGTCACGGCACAGAACGAGTTTCGCGTATGCGTGCCACCATCGTTCGCCTCTCAATGTCAGGGCCTTCAGAGTCTTGGCAGTCCGATCATTTGTGATACAGTGGAATCCAGTACAAACCCGTGCCCTGAGGGACGCCCGTACAATCAACGCCTTATCACCTTATCAGCTTATCACACCTTTAAAACGAATTTCGAAAACGATAAAGATAGGTATCGAGCGATACAGCGGATCGCGCTCCGTAGTTTGTGTTTGGTGTTTGTGAAAATGGGTTGGAAGAAAATTCTCTTAGCGGTCACGCTGATAGCCGGCGTCACGGCACAGAACGAGTTTCGCGTATGCGTGCCGCCATCGTTCGCCTCTCAATGTCAGGGCCTTCAGAGTCTTGGCAGTCCGATCATTTGTGATACAGTGGAATCCAGGCTCGACTGTATCATGAGACTGAACAGGGGTGATTCAGATTTCGGAGTGTTCTCTGAAGAAGAAATGGTATTGATGGCCCACAACCAGCCCAACGACAACCGGGTCGTGGCTTCCATCAAGGACATCCTTAGCAATGGCTCCTACGCGTTCGAAGCGGTCGCTGTAGTCCCAGCGAGTCACACTGGAGGTCTAGAGGGTCTCCGAGGTATGAAGTACTGTCACCCTGGTCTGGATGAAACTGAGGCGCGCTGGTCTCCACGAGTGCTTAAGACTTTAGAACGAGCTGCAGCCAGAACCGACCGCTGCCCGGATATGGATACCAACGGAAAAACAGCTGAGGAAATTGAAGTCCAAACTCTGAACTCTTTCTTCGGCGCTGCTTGTAGACCTGGACCTTGGAGCGCTAACTCCTCGGTTGATGCTGACCTCAAATCCCGTTTCCCTAACCTGTGCTCTCTGTGCGGTTCCAACGGAGACTGCTCCAAATACTCCATCGACATGGGCCCGAACATCGCCAACGTTGACAACAACAACAGACACATCCAGGCCTTGGAGTGTATGAGGTCCAATGAGAACAACACTTTCGCTTACGTCGCCTGGCAACACGTCCGCACTTACTTTAACGTACGAAATCCAAGGATTCGCGGCTCGTACGCTCTGCTTTGCGAGGACGGCTCCCTCCGTCCTTTGACAGTTGAGGCCACTAACTCCAATGTCTCCCCATGTTCGTTCGTCAGACAGCCCTGGAGCGCTGTCGTGGCTACCACTACCAGAGCAAGCGACGTGAGTGCTGCTCTCAAGACCTGGTGGCCAACAGGAACCAACCCTGGAGGAGTTTCTTGGCAGTCGTCACTGTACAATAACTTGATCGGAGGCGCCCTGAGCATAATTTCTTTTGAAGACAGTCTCCCTGGACCAGGAAACTACAGCCAAAGTCGTAATTTCACCGCCATCGACGCCTCCTCATCCTGCATACCAGCTCGTCGCTGGTGCACCATTTCTACCGAGGAACACAATAAGTGCTCCTGGGTCCGCGCCTCTGCCTACACCCTGGGCATAGAACCCACTATCTCCTTCGTCTGTGCTTATTGA

Protein sequence:

>DPOGS214999-PA
MGWKKILLAVTLIAGVTAQNEFRVCVPPSFASQCQGLQSLGSPIICDTVESSTNPCPEGRPYNQRLITLSAYHTFKTNFENDKDRYRAIQRIALRSLCLVFVKMGWKKILLAVTLIAGVTAQNEFRVCVPPSFASQCQGLQSLGSPIICDTVESRLDCIMRLNRGDSDFGVFSEEEMVLMAHNQPNDNRVVASIKDILSNGSYAFEAVAVVPASHTGGLEGLRGMKYCHPGLDETEARWSPRVLKTLERAAARTDRCPDMDTNGKTAEEIEVQTLNSFFGAACRPGPWSANSSVDADLKSRFPNLCSLCGSNGDCSKYSIDMGPNIANVDNNNRHIQALECMRSNENNTFAYVAWQHVRTYFNVRNPRIRGSYALLCEDGSLRPLTVEATNSNVSPCSFVRQPWSAVVATTTRASDVSAALKTWWPTGTNPGGVSWQSSLYNNLIGGALSIISFEDSLPGPGNYSQSRNFTAIDASSSCIPARRWCTISTEEHNKCSWVRASAYTLGIEPTISFVCAY-