Monarch geneset OGS2.0

DPOGS201053
TranscriptDPOGS201053-TA1170 bp
ProteinDPOGS201053-PA389 aa
Genomic positionDPSCF300289 - 219137-232766
RNAseq coverage17x (Rank: top 80%)
Annotation
HeliconiusHMEL0048911e-9254.12% 
BombyxBGIBMGA007993-TA1e-12376.60% 
DrosophilaCG31646-PA2e-7446.26% 
EBI UniRef50UniRef50_UPI00022C8FF41e-7548.08%UPI00022C8FF4 related cluster n=3 Tax=unknown RepID=UPI00022C8FF4
NCBI RefSeqXP_002133117.11e-7346.26%GA28999 [Drosophila pseudoobscura pseudoobscura]
NCBI nr blastpgi|3504018654e-7548.08%PREDICTED: lachesin-like [Bombus impatiens]
NCBI nr blastxgi|3454815446e-7946.13%PREDICTED: neurotrimin-like [Nasonia vitripennis]
Group
KEGG pathway 
InterPro domain[235-308] IPR0137831.7e-21Immunoglobulin-like fold
[228-308] IPR0130981.2e-12Immunoglobulin I-set
[227-299] IPR0035984.9e-10Immunoglobulin subtype 2
[127-212] IPR0035993.3e-06Immunoglobulin subtype
Orthology groupMCL25414 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201053-TA
ATGGAATCTTCGGTTTTGTTAAGAGTTCATCTCTTCTATTCCCTTGATCTTCCACGTTTTGTTGGACCAGGGTCTAATGTCACTGTGGCCGTCGGCAGAGATGCTGCACTTACTTGCAGAGTTGATAACTTACAATCGTTTAAGGTGGCATGGTTACGAGTGGATACGCAGACGATTCTAACGATAGCTGGTCACGTAATTACTAAGAATCACCGCATAAGTGTCCAGCATGGAGACGGAGCCTGGACCTTGGGGTTGAGAGACGTCAGCCCGACGGACGGAGGGCGGTATATGTGTCAAGTGAACACTGAGCCTATGATGAGTCAGACTCATTTACTACAGGTTGTGGTACCGCCTGATATTGACGATGACGTCAGTAGCAGCGAGGTTATAGTCAAAGAAGCAGATAACGCGGCCCTGCGATGTGTTGCCTCTGGAGTTCCCCCTCCAACAGTGACGTGGAGAAGGGAAGATTCTAGACATTTCAAAATTGATAACCACACTTTAATATCAAAACACAGCGGTGAATGGCTAAATTTGACTGGAGTTGAACGAGTTACATCCGGCTCGTATCTCTGTATAGCCACAAACGGTATTCCTCCGTCAGTGAGCAAGAGGATACAAATAAACGTCATGTTCGCTCCTTCCGTGTGGGCTGGCCGGGTGGCTATACGAGCGTTGGCTCACAGTGCGGCGACGCTCTCTTGCACTTCGGAAGCATTTCCTACACCAAATGTATACTGGATGCTTAATGGAGAACAGAGGCTTGTTAACGGTTCAAAGTATAAAATAAGCAAAATAAGCCGAGGCTACCGTCATACCCTGACGCTGCAAGTGAGCGAGATGACAAGAGATGATGCTGGAGCTTACCGCTGTCATGTTGAAAACAATATGGGGAAAGCACAAGCCGAGATGTTTCTACATTTATTGACCACAACCACAACGACTACCACCACTACGACGCCGCCTCCGACCACGACCACACCACCTCCGACCACCACTAGTATGTCAACATATGATGTCGGTCAGTTGGAACAGCTTCATCGAGGGGTGATGGAAGACCCTCGCGAAGACACGTACGTCGTCCTCAGTCAACACCAGATGCAGAACCTTGGTGAATATAACACTCCTCACGGCGTGTACTGTGTTGTATTTATTTATTGCAGATGA

Protein sequence:

>DPOGS201053-PA
MESSVLLRVHLFYSLDLPRFVGPGSNVTVAVGRDAALTCRVDNLQSFKVAWLRVDTQTILTIAGHVITKNHRISVQHGDGAWTLGLRDVSPTDGGRYMCQVNTEPMMSQTHLLQVVVPPDIDDDVSSSEVIVKEADNAALRCVASGVPPPTVTWRREDSRHFKIDNHTLISKHSGEWLNLTGVERVTSGSYLCIATNGIPPSVSKRIQINVMFAPSVWAGRVAIRALAHSAATLSCTSEAFPTPNVYWMLNGEQRLVNGSKYKISKISRGYRHTLTLQVSEMTRDDAGAYRCHVENNMGKAQAEMFLHLLTTTTTTTTTTTPPPTTTTPPPTTTSMSTYDVGQLEQLHRGVMEDPREDTYVVLSQHQMQNLGEYNTPHGVYCVVFIYCR-