Login
Help

TRANSCRIPT CARD

Submit your Data

  1. Transcript 'Haaura.CG.MTP2014.S6...'
  2. Transcript 'Cisavi.CG.ENS81.R22....'
  3. Transcript 'Cisavi.CG.ENS81.R16....'
  4. Transcript 'KH2012:KH.L61.1.v6.A...'
  5. Transcript 'Boleac.CG.SB_v3.S13....'

Transcript Model

Transcript Id

Boleac.CG.SB_v3.S13.g02158.01.t

Possible name(s)

KMT2A; SETD1A; SETD1B

Location

S13 [329,440 / 340,407]

Sequences

Amino acid sequence

Length: 1,584

>Boleac.CG.SB_v3.S13.g02158.01.p
MGYAWARMXIAIRQHFIMSEREQNRNVNSNDANSPDYKLLIDPLIRKGEEKLIRFAGETY
NTENHGQGVVIIRDPRPRMNRLWSRRERCDLPVPRYKVDEWYIGAVPPREITFSKLNDNV
TKQFMEKMCSKFGKLERVRVYYHPKTQKHMGLSKVVFSSMKSAAECVKQLHNTSIMGNVI
HAQIDSKGELLNKLFQRIARGEMVAETVPDVIRHTQDGISMSNKFKERSHRRTSKDGSRR
SSQDDPRKQRKSVDESDRHPSRRQHRKDSTKEVNESNGNSASSGNREENATDTYELSNSE
SRKKLTSSDRDDSQSEIEHQPVDLESMPHDNSAPDHPVDRYNHGDRLRYEREGSREHSPH
YRHNHHERGSDDPKIHHYHRSEYDRNREPPPGSRYYDRHGYDRGFYHGNCESRENDYGPR
RSEHRRMYSPPRGPYRHSNHLPSRYRHPSPERFNHHRYSPPRNRTERYGQSHPGSPEPWR
RSPGYHPDGDARHMDRRRRSHHGRSPSPVLPNRESEAHRNTPRSXDSKISLDQRLKEMTG
ETSSRTKRKKKDQTKEQQLDNESSVDMEISEGESDHDNQGVRQQAVDYDNINMSHERWNT
GAPSWNGHEAQPIDARLWNLPQHDFHEGFDRNHQLSGQYQMNYPPETWPSHFQPAVMPPG
VMPPQFNYYPPNSYPMMPPMMNSETQPIDEDENLCQCVLSAIVEDLKSTVSRDLNRKVVE
TMAFKVYESWWDENANARSSSKTTSVGLIGDKLTTINEKPDASQNSQPPPEKPDNKLKSQ
SLLNAFDPLNWAKNSFEMDGFRIGLGLRSAVLKMPSFRVKKRPIIPSPVKDDEREDAMLS
KKARLANGKENQERSIVSSKKKRQRITVLDSEGEDNHSEDEDSYAESASDDDTKVKSSVL
NVDEVFSDSAESSDDLSSDESEKSSDEEDESSESESENEAIPQADDDSKLEHKESVSVPL
DEKSVDLPVEKKIEKPEKKDEPQAEAKPISYLTPESRASTESDTTNTVDDSNISSLLSPT
ALGVTPSAITPQQLRHVHPKKRHRLISHDQYEPFAKIPATDHDKLKDDLSLITEKERTVL
EALTDMHKTPERRMTTDSDASVASLPDKLVVPSRNQENLHDPLSASPQNELDAVQALLSF
RDAGSQKPVIPKKPTIQLRYKNIVTLSESALVAALAEEHNYFVPCNTRRRLPSSDPVVAQ
LLEEHNYSRAMWEEAVSDHERPPTPLSESTLSAESVYNFKDEVPEIITAAECVKLRREKL
KKLPKTTKKPKLSAAAIQFTSRNKMEDLVIAYEMWCSGLDLEDMKYLRKVYEDHLEDPSA
PTWVNDSHWVPHTVTATVSPKKSKSKRRRKDRNVDDNLGFHKTGSARTEGYYKLKDMVKL
QQRREMYGEGESQSTTSLKMAKSASNGMPEGGVSAPDKNLGARETRHFMRRIAAEVGSDA
GDLFKYNQLMFRKKAVKFKRSSIHGWGLFAEEPIAADEMVIEYVGQVVRTLIAERREVNY
TELGIGSSYLFRIDTEHIIDATKCGNYARCYAKVITVDTAKKIVIYSKGPIKKSQEITYD
YKFPLEDDKVMCYCGAENCRRTLN

Nucleotide sequence

Length: 4,762

>Boleac.CG.SB_v3.S13.g02158.01.t
AGCAGTTATGGGATATGCGTGGGCCAGAATGTNAATTGCCATTAGGCAGCATTTTATCAT
GTCTGAGAGGGAACAGAATCGCAATGTGAATTCCAACGATGCGAATTCACCGGATTATAA
ATTGTTGATTGATCCCCTAATTCGAAAGGGAGAAGAAAAACTAATTCGTTTTGCTGGAGA
AACTTATAACACAGAGAACCATGGACAAGGTGTTGTTATCATCCGTGACCCTCGGCCACG
GATGAACAGACTATGGTCCAGGCGGGAAAGATGTGACTTGCCAGTGCCACGATATAAGGT
GGATGAATGGTATATTGGGGCTGTGCCACCAAGAGAGATCACCTTCTCAAAACTAAACGA
CAACGTCACTAAGCAATTCATGGAAAAAATGTGCTCAAAGTTTGGCAAATTAGAGCGAGT
TAGAGTCTACTATCATCCTAAAACCCAAAAGCACATGGGATTGTCAAAGGTTGTGTTTTC
ATCCATGAAATCTGCTGCTGAGTGTGTGAAGCAGCTACATAACACCTCAATAATGGGCAA
TGTAATCCATGCACAAATTGATAGCAAAGGTGAGCTGCTAAACAAGTTGTTCCAGAGAAT
CGCTAGAGGTGAAATGGTTGCCGAGACTGTTCCTGATGTTATTAGACATACACAAGATGG
TATTTCTATGTCAAATAAATTTAAAGAGAGATCTCATCGACGTACTAGCAAGGATGGTAG
CAGAAGATCGTCACAGGATGATCCAAGAAAACAAAGAAAATCTGTTGATGAGAGTGATAG
ACATCCTTCTCGCAGACAACACAGAAAGGATTCAACCAAAGAGGTAAATGAATCCAATGG
CAACTCGGCATCATCTGGTAATAGAGAAGAGAATGCGACAGATACTTATGAACTGAGCAA
TTCTGAATCAAGGAAAAAGCTCACCAGCTCTGATAGAGATGACTCACAGTCAGAAATCGA
GCATCAGCCTGTCGATTTGGAATCAATGCCTCATGATAATTCTGCTCCTGATCATCCTGT
GGATAGATACAACCATGGTGATCGGTTGCGATATGAACGTGAAGGTAGTCGGGAACATTC
TCCGCATTATCGTCATAACCATCATGAGAGAGGATCTGATGATCCTAAAATTCATCATTA
TCACCGTTCTGAATATGATAGAAATCGAGAACCACCCCCAGGTAGTCGGTATTATGATAG
GCATGGTTACGATAGGGGGTTTTATCATGGTAACTGTGAATCACGTGAAAATGACTATGG
TCCAAGAAGGTCAGAGCATAGGAGGATGTACAGTCCTCCAAGAGGGCCCTATAGACATTC
AAATCATTTACCTTCACGATACCGGCACCCATCTCCTGAACGCTTCAATCATCATCGTTA
TTCTCCACCGCGTAACCGAACTGAAAGGTACGGACAATCTCATCCCGGAAGCCCTGAGCC
TTGGCGTAGATCCCCTGGATATCATCCAGACGGTGATGCTCGTCACATGGATAGACGTAG
ACGGAGTCATCATGGACGAAGTCCAAGTCCTGTACTGCCTAACCGGGAATCAGAAGCTCA
CCGGAATACACCACGAAGTATNGATTCTAAAATATCGCTGGATCAACGATTGAAGGAGAT
GACAGGAGAAACATCAAGTCGAACTAAAAGAAAGAAAAAAGACCAGACCAAAGAGCAACA
ACTAGATAATGAGTCATCAGTTGATATGGAGATATCAGAAGGAGAGAGCGATCATGATAA
CCAGGGTGTAAGGCAACAAGCTGTTGATTATGATAACATAAATATGAGTCATGAACGATG
GAATACAGGGGCTCCTTCATGGAATGGGCATGAAGCTCAACCAATAGATGCAAGATTGTG
GAACTTGCCACAGCATGATTTTCATGAAGGTTTTGACCGAAACCATCAACTCTCTGGTCA
ATATCAAATGAACTACCCCCCCGAAACTTGGCCAAGCCATTTTCAGCCGGCTGTTATGCC
ACCGGGAGTCATGCCACCTCAGTTTAATTATTACCCTCCCAATTCTTATCCCATGATGCC
CCCGATGATGAATAGCGAGACACAACCGATCGATGAGGACGAGAATTTATGTCAGTGTGT
ATTGTCGGCAATCGTGGAAGATTTAAAGTCAACAGTTAGTCGAGATCTGAATCGTAAGGT
TGTTGAAACTATGGCATTCAAGGTTTATGAATCGTGGTGGGATGAAAATGCGAATGCCAG
GTCTTCTAGTAAAACTACTTCTGTCGGATTGATAGGAGACAAACTGACAACAATTAATGA
AAAACCAGATGCTTCCCAGAATAGCCAACCACCCCCGGAGAAACCTGACAACAAACTCAA
AAGTCAATCACTGTTAAATGCTTTTGATCCTCTGAACTGGGCAAAGAACTCTTTTGAAAT
GGATGGTTTTAGGATAGGGCTTGGTTTAAGATCAGCGGTGTTGAAAATGCCATCTTTCCG
TGTCAAGAAGAGACCGATAATACCATCACCAGTGAAAGATGATGAAAGGGAAGATGCGAT
GTTGAGTAAGAAAGCAAGATTGGCGAATGGAAAAGAAAATCAAGAACGAAGCATTGTCTC
GTCAAAGAAGAAACGACAACGTATCACAGTTTTGGATAGTGAAGGAGAGGACAATCATAG
TGAAGATGAAGACAGTTATGCCGAGTCTGCCTCTGACGATGACACTAAAGTTAAGTCTAG
TGTTTTAAATGTTGATGAAGTGTTCAGTGACTCGGCAGAGAGTAGTGATGATCTGTCAAG
TGATGAGAGTGAAAAGTCTTCGGATGAGGAAGATGAAAGCTCAGAGTCTGAGTCGGAGAA
TGAAGCTATTCCACAAGCTGATGATGACTCAAAACTCGAACACAAGGAGTCAGTTAGTGT
TCCACTGGATGAAAAATCAGTCGATTTGCCTGTTGAGAAGAAGATCGAAAAACCGGAGAA
AAAAGACGAGCCTCAAGCTGAAGCCAAGCCCATTTCTTATCTTACACCAGAATCACGGGC
GAGCACAGAAAGCGATACAACAAACACTGTTGACGACTCGAATATTTCTTCACTGTTATC
ACCGACTGCTCTGGGTGTCACTCCTAGCGCAATAACTCCGCAACAGTTACGACATGTACA
TCCAAAGAAACGACACAGGCTGATATCACATGATCAGTATGAACCGTTTGCAAAGATTCC
GGCCACCGATCATGACAAACTGAAGGATGATCTTTCGCTGATCACTGAAAAGGAAAGAAC
TGTACTAGAAGCATTGACTGATATGCACAAAACTCCTGAGCGTAGGATGACAACAGACTC
TGATGCTTCTGTAGCATCACTTCCTGACAAACTGGTTGTGCCCTCAAGAAATCAAGAAAA
TCTTCATGATCCTTTATCTGCGAGTCCACAAAACGAACTTGATGCAGTCCAAGCGTTATT
ATCATTCAGAGATGCTGGTTCACAGAAGCCTGTTATCCCGAAGAAGCCTACAATTCAATT
GAGATACAAAAATATAGTGACACTGTCGGAGAGCGCCCTGGTTGCTGCTCTTGCTGAAGA
ACATAATTATTTCGTGCCATGTAACACAAGACGGCGGTTGCCAAGCTCTGATCCTGTTGT
GGCACAGCTTTTAGAAGAACATAACTACAGTCGAGCAATGTGGGAAGAAGCAGTATCGGA
TCATGAAAGACCACCGACACCACTGTCTGAATCAACGTTATCTGCAGAATCTGTGTATAA
TTTTAAAGACGAAGTTCCTGAAATTATCACTGCTGCTGAGTGTGTAAAATTGCGTCGAGA
GAAACTGAAGAAGTTGCCAAAGACTACGAAGAAGCCAAAGTTGTCAGCAGCTGCGATTCA
ATTTACGTCGCGCAACAAAATGGAAGATCTCGTCATTGCTTACGAAATGTGGTGCAGTGG
ATTGGATCTTGAAGATATGAAGTATTTAAGAAAAGTTTATGAGGACCATCTTGAAGATCC
TTCAGCGCCGACATGGGTCAACGATTCTCATTGGGTACCGCACACTGTCACTGCCACGGT
TTCGCCTAAAAAGAGCAAAAGTAAAAGACGCAGGAAAGATCGAAACGTTGACGATAATCT
GGGATTTCATAAAACTGGCTCTGCGAGGACAGAAGGTTATTATAAGCTTAAAGATATGGT
GAAATTGCAGCAGCGGCGAGAAATGTATGGTGAAGGTGAATCACAATCTACAACTTCATT
AAAAATGGCAAAGAGTGCATCAAACGGTATGCCTGAAGGTGGAGTTAGCGCACCTGACAA
AAATTTAGGAGCTCGCGAGACAAGGCATTTTATGAGACGCATCGCTGCTGAGGTCGGCTC
TGATGCAGGCGACCTGTTCAAATATAATCAACTAATGTTCCGCAAGAAAGCTGTCAAATT
TAAACGTAGTAGCATACACGGCTGGGGCCTGTTTGCTGAAGAACCTATTGCTGCTGATGA
GATGGTGATTGAATATGTTGGCCAGGTGGTGAGGACACTCATTGCTGAGCGACGGGAAGT
AAATTACACCGAACTAGGAATCGGTAGCTCTTACCTGTTCCGTATCGACACGGAACATAT
TATCGACGCTACCAAGTGCGGGAATTATGCACGCTGCTACGCCAAAGTTATAACCGTTGA
CACTGCTAAAAAAATCGTAATCTACTCGAAAGGACCTATCAAAAAATCTCAGGAGATCAC
ATACGATTATAAGTTTCCATTGGAGGATGACAAAGTCATGTGTTATTGTGGAGCTGAGAA
CTGCCGAAGAACACTGAACTGA

InterProScan

PANTHER
SETD1A (IPR037841) - T[36-242] 1.7E-161 - T[247-531] 1.7E-161 - T[691-1253] 1.7E-161 - T[1256-1584] 1.7E-161
SUPERFAMILY
RBD_domain_sf (IPR035979) - T[87-190] 1.78E-14
Gene3D
Nucleotide-bd_a/b_plait_sf (IPR012677) - T[103-210] 9.9E-21
ProSiteProfiles
RRM_dom (IPR000504) - T[109-187] 10.445
SMART
RRM_dom (IPR000504) - T[110-183] 9.6E-9
Pfam
RRM_dom (IPR000504) - T[116-181] 9.5E-9
SMART
COMPASS_Set1_N-SET (IPR024657) - T[1283-1444] 3.8E-25
Pfam
COMPASS_Set1_N-SET (IPR024657) - T[1293-1448] 7.7E-18
ProSiteProfiles
SET_dom (IPR001214) - T[1454-1562] 11.35
SMART
SET_dom (IPR001214) - T[1454-1568] 1.9E-20
Pfam
SET_dom (IPR001214) - T[1465-1561] 2.7E-6
SMART
Post-SET_dom (IPR003616) - T[1568-1584] 3.9E-4
ProSiteProfiles
Post-SET_dom (IPR003616) - T[1568-1584] 9.74

Best Blast Hits in UniProt
Protein Name Identity Bit Score e-value
KMT2A_HUMAN 48.252 % 139 9.59E-33
SET1A_HUMAN 38.605 % 166 3.56E-41
SET1B_HUMAN 45.679 % 256 9.66E-69