Monarch geneset OGS2.0

DPOGS213048
TranscriptDPOGS213048-TA1380 bp
ProteinDPOGS213048-PA459 aa
Genomic positionDPSCF300331 - 235497-257356
RNAseq coverage29x (Rank: top 76%)
Annotation
HeliconiusHMEL0080151e-16563.10% 
BombyxBGIBMGA008935-TA4e-0967.27% 
Drosophila% 
EBI UniRef50UniRef50_Q8C1196e-1323.77%Protein NDNF n=22 Tax=Euteleostomi RepID=NDNF_MOUSE
NCBI RefSeqXP_971168.12e-1223.81%PREDICTED: similar to Fibronectin type-III domain-containing protein C4orf31 homolog [Tribolium castaneum]
NCBI nr blastpgi|2240491976e-1524.20%PREDICTED: hypothetical protein [Taeniopygia guttata]
NCBI nr blastxgi|472095152e-1423.60%unnamed protein product [Tetraodon nigroviridis]
Group
KEGG pathway 
InterPro domain[161-456] IPR0193269.3e-20Protein of unknown function DUF2369
Orthology groupMCL21970 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213048-TA
ATGAATAGTTATATAAGTCCGGACATCGTCACGACAAGGCTAGCAGCGCTGATCATACTTCACGCAGCTAGAGCTGGATTCGCAGCTCCAAGTGTTCTCGTCACCAGACCAGTCAGCAGGCTTTCCGAAGAATGGCTGCCAATGGATAACCAGACCTTGTACACATTGGAAGAAGGAGAAAGTCGAAATCCAATAGACCCTCAATCTACAACCTATTGCGTGGTCGCATCTCGGAGAAAGAACTACACTTCACTTTGCGCAGCTCAATACGACCTCCGGAATACAAAAACAGAAAAAGATACATCCTCTGATCAGTCGCTGGAAAACCATAAACCTAACCGTTCACAAAACAGCGGTTCGGAAGAGAATAAAATTGATATAGATAAAGACAGCATAAGTATTTTTGATGGAAATTATAGGACTTTGTATAGAAGAAAAAAGTTTGGTCGTAGCACAAGAGTATCAAACGAAGACCCACTGGTTGTTTGTATAGGAGACAGAACACATCACTTTATTGAAAATTTAGATCCTGGCACAACCTATTTCGTCAGTATTTTCGGCATTGCAAGAGATAGACAAATTGGGTCCTTATTAGCATCTGGAAGCGTTCGACCCAGGACGTCCACTGCTAAAAGACTAAGGGAAAATGTTCCTTACAAGGCTGACATAAAGGGAAAAAATGTTTATTACTTGAAGACAACGACCAGTAGTACTTCAACAAACGCTGGTCTTTGGATTGCGACGTCCACTTGCGGAGGTTCAGTTGACATCGAAGTTTATGTAAAAGGAAAGCGATTGTACGTGGCCAAAAATATAGAAAATCATTCAAAGTTTTTCGTGCCATCACCAATACTTTCATCGACACAAGAAACAAGTGACGAGGGTTCTGTGCAGTTCGATTCGAGCTCAGAGGAATCAAAAATAAGATACATTGTACGAGTTGTGCCTAACAAATGGGCTATTGATGGTGCCGTAACAATAGAGTTACTGGCCTCAACATCAAGATGGGGCATGGATACGCCAGAACTTACCGACGGCGGAGTGATAAGGGAACTCCGGCCGAGGAGGTCTTGTAAATCCGTAGACATTGCCTTTTTACCAGCCACCCATAACGCGACTGATGTGATTAGATATTGCATTTCAACAAAAGAAATAGTCGACAAGGACATAACAATCTGTGCGTTAACAAAAAAATCATCAACAAAAACACAATGCATAAGTCACATGCAACGTCCTCAGTCAAGAGTAATAGTCCAGAAAGTGAGTGGCTTAAAACCGGGAAGAAAATACGGAATTCAAGTAACTGCCGCCTCCAAAGGGATCTCAGTGCCATACAATGTCCTGTATGTGGAAACTAATGCAACTTGCAAAGAAGAATAG

Protein sequence:

>DPOGS213048-PA
MNSYISPDIVTTRLAALIILHAARAGFAAPSVLVTRPVSRLSEEWLPMDNQTLYTLEEGESRNPIDPQSTTYCVVASRRKNYTSLCAAQYDLRNTKTEKDTSSDQSLENHKPNRSQNSGSEENKIDIDKDSISIFDGNYRTLYRRKKFGRSTRVSNEDPLVVCIGDRTHHFIENLDPGTTYFVSIFGIARDRQIGSLLASGSVRPRTSTAKRLRENVPYKADIKGKNVYYLKTTTSSTSTNAGLWIATSTCGGSVDIEVYVKGKRLYVAKNIENHSKFFVPSPILSSTQETSDEGSVQFDSSSEESKIRYIVRVVPNKWAIDGAVTIELLASTSRWGMDTPELTDGGVIRELRPRRSCKSVDIAFLPATHNATDVIRYCISTKEIVDKDITICALTKKSSTKTQCISHMQRPQSRVIVQKVSGLKPGRKYGIQVTAASKGISVPYNVLYVETNATCKEE-