Monarch geneset OGS2.0

DPOGS203338
TranscriptDPOGS203338-TA1308 bp
ProteinDPOGS203338-PA435 aa
Genomic positionDPSCF300003 - 212802-217819
RNAseq coverage266x (Rank: top 40%)
Annotation
HeliconiusHMEL0179348e-16568.56% 
BombyxBGIBMGA003860-TA1e-8670.95% 
DrosophilaCG9104-PA1e-9242.32% 
EBI UniRef50UniRef50_Q9VXA02e-9042.32%CG9104 n=27 Tax=Coelomata RepID=Q9VXA0_DROME
NCBI RefSeqXP_001664256.11e-10744.93%hypothetical protein AaeL_AAEL003870 [Aedes aegypti]
NCBI nr blastpgi|1571385672e-10644.93%hypothetical protein AaeL_AAEL003870 [Aedes aegypti]
NCBI nr blastxgi|3123774795e-9142.35%hypothetical protein AND_11201 [Anopheles darlingi]
Group
KEGG pathway 
InterPro domain[19-190] IPR0093483.8e-55Nitrogen permease regulator 2
Orthology groupMCL10638 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203338-TA
ATGTCCTCAAAAATTATGGAAACACGTTATTACGAGGGATGTGGACGAGAGGGTCCCATCCGCTGCATCTTCCTGGGGGAGTTCCATCCGGTAGCAGGACCCAAAATATCGTGCCAGTTTCCTGAAGACTATGTGTCTAAGGAGCTGTTTGATTCAATAAGTGCTTACATCATTCCAAAACCACAAATACAGAAATGCACAATGACAATTAACGCCCTCGGTCACAAAATCATCGGCTATCCAATACGAATAGAAAACTCGAGGTATGAACGGAACATGTATCTGTTCAATCTCTGCTTTGTCTGCGACAGTTGGTCGAAGACCGTTCAGTATGAACCCGTTGTGAAGAAGCTCGGAGAGCACTTGACGATCATGGAAGAGGAGACCCGTTTTGTTTCAAGTGGCTCCAGCAAACTGCCGACCCTGCTTGCACACTTACTCCACGATCTGAACACTTATAGGAAAGCGACGCTCGTTGAGGGCGACACGGTTATGCACCTCAAAGTACTGGAAGTGAGAAAAGATCCAACCCCCGTCCATGATTTTGATGTTCCTGTACTGGTGGCTTCGGTAGGGCTTCCGCGACCGGGGCGCCCGCCGCGTGTGCCCCCCGGACCACAAGAGGAGTCCGAGCAGAGCCAGGAACACATCGAAGTTGAAGAAGAACCCTTTGAGGTGCACCTAGACGCCGACTGGGATCTCACAACAAGACAGCTTCTTCCTCACATAAACGGTTACAACCACATCTCGAAGATAGCTTCGGACACCAACGTTGAAAAGACACTCGTCAAGTCGTGTATACAAAACTTGGTGTACTACGGAGTGGTGACTCTGATACCCGTGCTCAAGTTCAGCAACATGTACCGAGCCACACCTAACCTGAGCCGATTAATGAACGATCATGACATGCAGCACTCGTGTCTGAAGTACATCAACAACGACTGTGAGGGGAAGGACAAGCCGTCACTGTCCGACGTGGTGGGTGTGCTGTGTTCACTGCAGCAGGGTACAACGTTGCGTGCGGTGTGCGATCGGCACTTCACATCCCCCGGGGTGCCGTTCGATGTGAGACGGCTGATAGTGTTCGCACAAATACACGGCCTCGTAAAGTGTCTTAAGAGGTATCCAGTCTATATCCGGAACCCGACACGTCAGAACGGCTACAGAGTCGACTCTATAATAGGTATACGAAGGCTGTTCACTGGCAGGCACAACGTGGACGAGATATGCTGCCTAGCCCGCATTGACCTCCCTACCCTCGATCAGATTATAGAAGACGATCCCAACGTTATTATAATATGGAGATAG

Protein sequence:

>DPOGS203338-PA
MSSKIMETRYYEGCGREGPIRCIFLGEFHPVAGPKISCQFPEDYVSKELFDSISAYIIPKPQIQKCTMTINALGHKIIGYPIRIENSRYERNMYLFNLCFVCDSWSKTVQYEPVVKKLGEHLTIMEEETRFVSSGSSKLPTLLAHLLHDLNTYRKATLVEGDTVMHLKVLEVRKDPTPVHDFDVPVLVASVGLPRPGRPPRVPPGPQEESEQSQEHIEVEEEPFEVHLDADWDLTTRQLLPHINGYNHISKIASDTNVEKTLVKSCIQNLVYYGVVTLIPVLKFSNMYRATPNLSRLMNDHDMQHSCLKYINNDCEGKDKPSLSDVVGVLCSLQQGTTLRAVCDRHFTSPGVPFDVRRLIVFAQIHGLVKCLKRYPVYIRNPTRQNGYRVDSIIGIRRLFTGRHNVDEICCLARIDLPTLDQIIEDDPNVIIIWR-