Monarch geneset OGS2.0

DPOGS214722
TranscriptDPOGS214722-TA2448 bp
ProteinDPOGS214722-PA815 aa
Genomic positionDPSCF300022 - 99149-112418
RNAseq coverage433x (Rank: top 28%)
Annotation
HeliconiusHMEL0085930.071.79% 
BombyxBGIBMGA005066-TA0.067.03% 
DrosophilaKrT95D-PB1e-7337.19% 
EBI UniRef50UniRef50_A0NG638e-13238.76%AGAP003312-PA n=2 Tax=Anopheles gambiae RepID=A0NG63_ANOGA
NCBI RefSeqXP_001657096.13e-13039.74%hypothetical protein AaeL_AAEL003623 [Aedes aegypti]
NCBI nr blastpgi|3479696521e-13240.00%AGAP003312-PB [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479696526e-12939.74%AGAP003312-PB [Anopheles gambiae str. PEST]
Group
KEGG pathway 
InterPro domain[432-808] IPR0193813.2e-84Phosphofurin acidic cluster sorting protein 1
Orthology groupMCL11276 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214722-TA
ATGACGGAATTATGTCAGCCTGGAAATAGGCTGTGCACGCTTCGTGTGACTCGTCTCCGCGTGTGTGGTTCCCTGGGGGACTCTGGGGGGAGCGGCGCGGGCAGTGCGGGCGCCGGGGCTGTCACCCTCGCAGCCAGGATGCACAGCAGCAAGCGCACCCTCCGTTCCAACGACATCACCGTACCCCCTATGGATGTGCTGCAACGTTCGATGGATATGGAACTAGAGTTGACTAGCGTGGGCGGGAAGGTGGGCGCCGGTGGGCAACCCGTGGCCAGGCTCACTATCACCGGACTCGCCTCCACACCGGTCGATCACGACACCAAGAACAACAACACGCTGCTCATCACCGAGCGCGGATACTCCGACGAGGAGGAGGAGGGCGAGTTCAGCTCGGTGGACGAAGCTGACGACATGACGTACGGCGGAGCCGCGGGCAGGAGGGCGCACAGGCAGCTAGCCTTCAAACAGGGAGACCTCTCATATACTAAGCTATCACGCTCGAGAGAGCTAAGCTTCAGGGAGCGAGAGAGAAAGATACAGAGCTATATAAGGTCGAAGGAGCAAGAGAGAGACAGCGATAGTGAGATGGAGACCACCAGCAAGAGAAAGGCCAGTAGCGGCAAGCTCGCGCAAAGGAATCTGAAGCAGAAGTTCGCGGCGCTGCTGAGGAGGTTCCGCGTGCCCGAGGAGCTGGCGGACAGAGGAGCGGCCAGGGACACGCACCACGCTCAGAGAGACATAGACGAGCTGTTCCAGGAGTTGGAGTCCCTGTCGTGCGGCGAGGGCGAGGACTCCGGACCGGACCAGATGGACACCATCAGTATCGGCTCAACGCCCAAGCCCTCGCTGCGGCCGTTCTTCAGCAGCTCCAGGAGCCTCGCCAACCAGGAACACCACAGACATTCCAACGTGACGCGACACGCGAGCCTGGCCTCCGCCGGGGAGCTGCGGCCGCGCGAGAGACTCGCCTCCGCACCGCTCACAGAGTCAGAGGCTGTACGCAGTTCGGCTGGCGATGAACGAGCCAGCGAGGGGAACAGCGACGGTGACGTGACGCTGGACGCGCCCGTCTCTGGCGCATCCGTATCCAGCTCGCCGCCCAATGAGACTAAGTCGTCGGAGGACAAGCGGTCCCGTTTGTTCCGCAGCGCCACCAGCGGCGGCAACCTGACCCCGGCCGTCTCCTCGCGCAAGAAGAACTCCCTCGTCATCCACACGGAGCGGCCGCTCAGCGCACACGACCTGCCCTCACACAGTCCCACCACCGTGGAGCCCCGTCGCACTCTACTGGAGCAAGTGTCTCGCGTGCTGGGTGAGGAGGGTCCCGGGCCGGAGTGTGTGGCGGTGGCGCCCCCGGGCCTGGTACGGGCTCTGCACGCGCTCGGCGTCCCCACGGTGCCTTCGCCCCCGGCGGCCGTGCCCGACGCTCGACCACTCCTCCAGGCCCTGCTCGCCCGGGCCGCTAAACAAGGGGTCCGGCGCGTGGTCCGCGTGGTGGTGTGCGGAGGCGAGGCGGCGGCGGCCGCGGCTCTCAGGGCGCACGCTGAGCTCGCCGCCCGGAGACACGACGCTCCGCTCCTCAGATACTACATCATACCCACAGGTGTGAACACGGTGGCGCGCTGGTTGGGCGCCATGGACAGCACGTACGCGGGGTTGTTCTGCAACGAGTCCTGGAGCGCGCTCTGTGAACGGGCACAGGACGCCTGTATGGCCGACGTGGCCGAGATGAGCTCCAGGATAGCCAGGTACATCAATAGCATCGGCCCCGTCAACAACATACCCATAGGAGAGGCGATGGTCGCCTACCGGGAGCGGTCCGGGGACGAGGACTCCAGCCAGACCTTCGTGCCCTTCATAGCGGAGGTGCGAGTCGGTTGCGGCGAGGGTGGTCCCTCGTCCCTGGAGCTAGACGAAGGCGGCTCGCCCCCGGCCCGGCCTTCGCCCCCCGCCACGCCCGCTCCGGCCTCCGCCCCGGACCTCGCCCCGCCGCTCGCGTTCGCACGGACCGAGCCCCTGGAACTACAGCTGGACTACTGGCTGGTTGGCGCTCGCTGCTCAGAGGGCAGCGCGGCGGCGGGGGCGGGCGCGGAGGGCGGCAAGGTGACGCTCAAGGCGACCTTCAGAGCTCTGCTCGTCACCAGGCACAATCACCACCTCTGCATCACATACCTCACCAAGGAAAAGAAACAAAAAATCATGCGCCTGGGAAAGAAAAAGGAGAAACCCGGCGAGGCTGAAGGGGGGCGCGCTCACACCGTGGAGGGAGTCGCCAGGCTGATCTGCTCCGCCAAGGGCTCGCACAACGCACCGCTTAAAGTGTACATAGACGGCACGGAGTGGAACGGCGTGAAGTTCTTCCAGCTGTCGACCCAGTGGCAGACGCACGTCAAGACCTTCCCCGTGGCCACGTGCGGGGCTCCGCTGGCGCCCTCGGACTCCTAG

Protein sequence:

>DPOGS214722-PA
MTELCQPGNRLCTLRVTRLRVCGSLGDSGGSGAGSAGAGAVTLAARMHSSKRTLRSNDITVPPMDVLQRSMDMELELTSVGGKVGAGGQPVARLTITGLASTPVDHDTKNNNTLLITERGYSDEEEEGEFSSVDEADDMTYGGAAGRRAHRQLAFKQGDLSYTKLSRSRELSFRERERKIQSYIRSKEQERDSDSEMETTSKRKASSGKLAQRNLKQKFAALLRRFRVPEELADRGAARDTHHAQRDIDELFQELESLSCGEGEDSGPDQMDTISIGSTPKPSLRPFFSSSRSLANQEHHRHSNVTRHASLASAGELRPRERLASAPLTESEAVRSSAGDERASEGNSDGDVTLDAPVSGASVSSSPPNETKSSEDKRSRLFRSATSGGNLTPAVSSRKKNSLVIHTERPLSAHDLPSHSPTTVEPRRTLLEQVSRVLGEEGPGPECVAVAPPGLVRALHALGVPTVPSPPAAVPDARPLLQALLARAAKQGVRRVVRVVVCGGEAAAAAALRAHAELAARRHDAPLLRYYIIPTGVNTVARWLGAMDSTYAGLFCNESWSALCERAQDACMADVAEMSSRIARYINSIGPVNNIPIGEAMVAYRERSGDEDSSQTFVPFIAEVRVGCGEGGPSSLELDEGGSPPARPSPPATPAPASAPDLAPPLAFARTEPLELQLDYWLVGARCSEGSAAAGAGAEGGKVTLKATFRALLVTRHNHHLCITYLTKEKKQKIMRLGKKKEKPGEAEGGRAHTVEGVARLICSAKGSHNAPLKVYIDGTEWNGVKFFQLSTQWQTHVKTFPVATCGAPLAPSDS-