Monarch geneset OGS2.0

DPOGS214254
TranscriptDPOGS214254-TA1794 bp
ProteinDPOGS214254-PA597 aa
Genomic positionDPSCF300014 + 1440267-1443749
RNAseq coverage52x (Rank: top 70%)
Annotation
HeliconiusHMEL0113707e-7468.56% 
BombyxBGIBMGA005970-TA2e-5654.17% 
DrosophilaCG18635-PA4e-4623.78% 
EBI UniRef50UniRef50_B0WMX67e-4726.06%Putative uncharacterized protein n=2 Tax=Culicinae RepID=B0WMX6_CULQU
NCBI RefSeqXP_001658668.12e-5229.06%hypothetical protein AaeL_AAEL007792 [Aedes aegypti]
NCBI nr blastpgi|1571169564e-5129.06%hypothetical protein AaeL_AAEL007792 [Aedes aegypti]
NCBI nr blastxgi|1571169561e-5629.68%hypothetical protein AaeL_AAEL007792 [Aedes aegypti]
Group
KEGG pathway 
InterPro domain[33-226] IPR0186295.5e-09Transport protein XK
Orthology groupMCL15023 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214254-TA
ATGGCGGTGTGTGACTTAGACAATGATATATCTGGGGAAGTGCTAGGATGGAAGTTACCGGATTGGCAGGCATTGCTGTTTCATCGTGTGCTACCATCTTGTGGCGGACTAGCAATTTATTTGACTGTTATTTGTTACGATTTAGCACTTATTTACGAACATATAAATAATGGTTATAAAGCACTTGGCATATTGTATTTAATACTAATGATATTGCCAGCTCTGCTGTCGTTAATATTTACGTTAGCATCACCGCCACCTGGCCTACAGACCGAGGAATCAGCTTTTACAGTTACAATTAACAATGATGACGTCAGGTGGATTTTCGAAGAAATTTTGAAGACTGTACTATTTCCTATATCTGTTTGTGGCAGATACTGTTTTTTAATATTTTGGTGGATAGAAGCGGTGTACGCCGCCCGCGAAAACGATGAAGATAGAACAAAGGAAGCCCTGTCCAAAGCTCGTAAACCATCATCGGTCGAGCTGTACTTATTCCTGCAAGCTTTCATACATTCGGCACCTTACGTCGTTATAAACATTTTGGATATGATGGCGAACTATGCGGACCCGAAATATGATAGAATGACTCTGCAAGCAGTGAGTATGATTGCTTCATCACTTCGTATGGCGAGTACTGCAACGATATACAGAAGATTTGAAAGGGAAAAACTGTGTGGACGTAAATATCCATGGAATGCTAAGACAGAAAGTGAAAATTTAAATAAAACTGTTGATTCCGAAAAGGAAAACAGAACAAATGAAACTGTCACAGTCGAACCACTCTTTGGTGCTGTATTAGAACGACATTCATCATCAATTTCTCGTAAAACAATTGACGACAGGGATGAAATTATAAGTGATTTGATACATTTTTCACCAAGGAACTCAGAAAAGAATTCCGTATTATACGAAGATAATATCGATTACAATTCGCATTCCGATACAAGTTCTGACTACTCGCCTCCTGCGACGCGTAACGAGATACTGGACAGTGATGATGAATACGTTAAGCCGATATCGATTATAGATAAAATAGCACCGCGACGATCCGCTGAATACACGGTAGAGCATGTGTACGTCCCTCCGCCGCCAGCGTTCATGGCCCCGCGGCCAGGGTCCTTCGCTGTGTGGGCGGAGAAATTAGTTGAAAACGCGGAATCAATCCCCACGTGGCTGTCTGCCCCGCCAAGAAGAAAACACTGGGAAGTCATCCCGGACGAACCTGACGTCCCACGGCGAGTACCGCGTAACTACATGCGAGGTCTCCAGCCTCAGGATGCCACAGCAGCTTTAGTAAATTTTTTAGGGTGGTATGCTTTTTTTGTAGCGCGTTTGCTCTCTATCGCAGCTTTTATAGATTTTTTTCCGTTCGTAGCGATTATAATTTTAATGTGCCACTATCAAGCCATGTTGCTATTTTTAATAGTCCCACAGGCGAGTACAGTGAAGAGGGCTTTCTATGTGTTCCTCGCCTTCATATACTTATTTTGCCTTATGGAGTTTAAGATCCGTTTCCGCCACGTTCGCGTGTGGCATGTGTTTTGGATCATAGTGTGCACTGTTGAGATCGTTGTATTCATATCTTTATGGGCGACCATTGATAATAATCTCCACGATTGGTGGAAACATTTCGTAGTAACTGTGACCCTCGTCAGTATGTGTATTAGTTATGTACTGTTTCTTTCATATTTCGTCCTCTTGCAGCCCAGAGAGACCGTTGTATATGTCGACGAGAAAAACTGTTACAAATATAGGAAAGATTTAGAGAAAATTAATAAATTAGATATTTAA

Protein sequence:

>DPOGS214254-PA
MAVCDLDNDISGEVLGWKLPDWQALLFHRVLPSCGGLAIYLTVICYDLALIYEHINNGYKALGILYLILMILPALLSLIFTLASPPPGLQTEESAFTVTINNDDVRWIFEEILKTVLFPISVCGRYCFLIFWWIEAVYAARENDEDRTKEALSKARKPSSVELYLFLQAFIHSAPYVVINILDMMANYADPKYDRMTLQAVSMIASSLRMASTATIYRRFEREKLCGRKYPWNAKTESENLNKTVDSEKENRTNETVTVEPLFGAVLERHSSSISRKTIDDRDEIISDLIHFSPRNSEKNSVLYEDNIDYNSHSDTSSDYSPPATRNEILDSDDEYVKPISIIDKIAPRRSAEYTVEHVYVPPPPAFMAPRPGSFAVWAEKLVENAESIPTWLSAPPRRKHWEVIPDEPDVPRRVPRNYMRGLQPQDATAALVNFLGWYAFFVARLLSIAAFIDFFPFVAIIILMCHYQAMLLFLIVPQASTVKRAFYVFLAFIYLFCLMEFKIRFRHVRVWHVFWIIVCTVEIVVFISLWATIDNNLHDWWKHFVVTVTLVSMCISYVLFLSYFVLLQPRETVVYVDEKNCYKYRKDLEKINKLDI-