BLASTP 2.7.1+ Reference: Stephen F. Altschul, Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for composition-based statistics: Alejandro A. Schaffer, L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Database: GCF_000001405.40_GRCh38.p14_protein.faa 136,193 sequences; 94,334,868 total letters Query= KBF55062.1 Length=409 Score E Sequences producing significant alignments: (Bits) Value XP_047272971.1 collagen alpha-1(XXIV) chain isoform X5 [Homo sa... 42.7 0.003 XP_016856416.1 collagen alpha-1(XXIV) chain isoform X4 [Homo sa... 42.7 0.003 XP_016856417.1 collagen alpha-1(XXIV) chain isoform X8 [Homo sa... 39.3 0.031 NP_001336884.1 collagen alpha-1(XXIV) chain isoform 2 [Homo sap... 38.9 0.040 > XP_047272971.1 collagen alpha-1(XXIV) chain isoform X5 [Homo sapiens] Length=1683 Score = 42.7 bits (99), Expect = 0.003, Method: Compositional matrix adjust. Identities = 139/408 (34%), Positives = 154/408 (38%), Gaps = 85/408 (21%) Query 4 NGGAGGSGAPGAIGGAG--------GPAGLIGVGGAGGAGGDSAVAGVIGGAGGAGGAAL 55 +G G G PG IG G GP G++G+ G G G G IG G G Sbjct 1058 DGEKGEMGLPGIIGPLGRSGQTGLPGPEGIVGIPGQRGRPGKKGDKGQIGPTGEVGSR-- 1115 Query 56 LFGAGGAGGAGGSGGSGAAG-----------------------GAGGAGGAGGLFASGGS 92 G G G SG GA G G G G GL G Sbjct 1116 ----GPPGKIGKSGPKGARGTRGAVGHLGLMGPDGEPGIPGYRGHQGQPGPSGLPGPKGE 1171 Query 93 GGFGGFAST--------------GTGGAGGTGGAGGLFASGGVGGTGGGAGSGGTGGVGG 138 G+ G ST G G G GA G GV G G G G G G Sbjct 1172 KGYPGEDSTVLGPPGPRGEPGPVGDQGERGEPGAEGYKGHVGVPGLRGATGQQGPPGEPG 1231 Query 139 TGGAGGLFASGGAGGAGGAATTGTGGAGGAGGKAGL-----LFGSGGAGGAGGSSGIGGF 193 G GL G G G G GA G GK G+ L G G G G+ GI G Sbjct 1232 DQGEQGL--KGERGSEGNK---GKKGAPGPSGKPGIPGLQGLLGPKGIQGYHGADGISG- 1285 Query 194 AAGGAGGPGGAGGLFNGGGAGGAGGSGVSGGAGGEGGAGGSGGGGSVAGDGGAGGNAGLL 253 G GP G GL G GG G +G++G G G G SG GS G G G GL Sbjct 1286 -NPGKIGPPGKQGL--PGIRGGPGRTGLAGAPGPPGVKGSSGLPGS-PGIQGPKGEQGLP 1341 Query 254 -APGLAGGAGGGGGQGFDTGGAGGPGGDAGL--LVGSGGVGGAGGFGLTTGGPGAAG--G 308 PG+ G G G QG GP GD GL G GV G GF G PG G G Sbjct 1342 GQPGIQGKRGHRGAQG-----DQGPCGDPGLKGQPGEYGVQGLTGFQ---GFPGPKGPEG 1393 Query 309 DAGLLFGSGGAGGAGGSGRTDLGGAGGAGGKAGLIGNGGNGGAGGAGG 356 DAG++ SG G G G T G G+ G+IG G G G G Sbjct 1394 DAGIVGISGPKGPIGHRGNT------GPLGREGIIGPTGRTGPRGEKG 1435 > XP_016856416.1 collagen alpha-1(XXIV) chain isoform X4 [Homo sapiens] Length=1711 Score = 42.7 bits (99), Expect = 0.003, Method: Compositional matrix adjust. Identities = 139/408 (34%), Positives = 154/408 (38%), Gaps = 85/408 (21%) Query 4 NGGAGGSGAPGAIGGAG--------GPAGLIGVGGAGGAGGDSAVAGVIGGAGGAGGAAL 55 +G G G PG IG G GP G++G+ G G G G IG G G Sbjct 1073 DGEKGEMGLPGIIGPLGRSGQTGLPGPEGIVGIPGQRGRPGKKGDKGQIGPTGEVGSR-- 1130 Query 56 LFGAGGAGGAGGSGGSGAAG-----------------------GAGGAGGAGGLFASGGS 92 G G G SG GA G G G G GL G Sbjct 1131 ----GPPGKIGKSGPKGARGTRGAVGHLGLMGPDGEPGIPGYRGHQGQPGPSGLPGPKGE 1186 Query 93 GGFGGFAST--------------GTGGAGGTGGAGGLFASGGVGGTGGGAGSGGTGGVGG 138 G+ G ST G G G GA G GV G G G G G G Sbjct 1187 KGYPGEDSTVLGPPGPRGEPGPVGDQGERGEPGAEGYKGHVGVPGLRGATGQQGPPGEPG 1246 Query 139 TGGAGGLFASGGAGGAGGAATTGTGGAGGAGGKAGL-----LFGSGGAGGAGGSSGIGGF 193 G GL G G G G GA G GK G+ L G G G G+ GI G Sbjct 1247 DQGEQGL--KGERGSEGNK---GKKGAPGPSGKPGIPGLQGLLGPKGIQGYHGADGISG- 1300 Query 194 AAGGAGGPGGAGGLFNGGGAGGAGGSGVSGGAGGEGGAGGSGGGGSVAGDGGAGGNAGLL 253 G GP G GL G GG G +G++G G G G SG GS G G G GL Sbjct 1301 -NPGKIGPPGKQGL--PGIRGGPGRTGLAGAPGPPGVKGSSGLPGS-PGIQGPKGEQGLP 1356 Query 254 -APGLAGGAGGGGGQGFDTGGAGGPGGDAGL--LVGSGGVGGAGGFGLTTGGPGAAG--G 308 PG+ G G G QG GP GD GL G GV G GF G PG G G Sbjct 1357 GQPGIQGKRGHRGAQG-----DQGPCGDPGLKGQPGEYGVQGLTGFQ---GFPGPKGPEG 1408 Query 309 DAGLLFGSGGAGGAGGSGRTDLGGAGGAGGKAGLIGNGGNGGAGGAGG 356 DAG++ SG G G G T G G+ G+IG G G G G Sbjct 1409 DAGIVGISGPKGPIGHRGNT------GPLGREGIIGPTGRTGPRGEKG 1450 > XP_016856417.1 collagen alpha-1(XXIV) chain isoform X8 [Homo sapiens] Length=972 Score = 39.3 bits (90), Expect = 0.031, Method: Compositional matrix adjust. Identities = 136/412 (33%), Positives = 152/412 (37%), Gaps = 93/412 (23%) Query 4 NGGAGGSGAPGAIGGAG--------GPAGLIGVGGAGGAGGDSAVAGVIGGAGGAGGAAL 55 +G G G PG IG G GP G++G+ G G G G IG G G Sbjct 334 DGEKGEMGLPGIIGPLGRSGQTGLPGPEGIVGIPGQRGRPGKKGDKGQIGPTGEVGSR-- 391 Query 56 LFGAGGAGGAGGSGGSGAAG-----------------------GAGGAGGAGGLFASGGS 92 G G G SG GA G G G G GL G Sbjct 392 ----GPPGKIGKSGPKGARGTRGAVGHLGLMGPDGEPGIPGYRGHQGQPGPSGLPGPKGE 447 Query 93 GGFGGFAST--------------GTGGAGGTGGAGGLFASGGVGGTGGGAGSGGTGGVGG 138 G+ G ST G G G GA G GV G G G G G G Sbjct 448 KGYPGEDSTVLGPPGPRGEPGPVGDQGERGEPGAEGYKGHVGVPGLRGATGQQGPPGEPG 507 Query 139 TGGAGGLFASGGAGGAGGAATTGTGGAGGAGGKAGL-----LFGSGGAGGAGGSSGIGGF 193 G GL G G G GA G GK G+ L G G G G+ GI G Sbjct 508 DQGEQGL-----KGERGSEGNKGKKGAPGPSGKPGIPGLQGLLGPKGIQGYHGADGISG- 561 Query 194 AAGGAGGPGGAGGLFNGGGAGGAGGSGVSG-----GAGGEGGAGGSGGGGSVAGDGGAGG 248 G GP G GL G GG G +G++G G G G GS G + G G G Sbjct 562 -NPGKIGPPGKQGL--PGIRGGPGRTGLAGAPGPPGVKGSSGLPGSPG---IQGPKGEQG 615 Query 249 NAGLLAPGLAGGAGGGGGQGFDTGGAGGPGGDAGL--LVGSGGVGGAGGFGLTTGGPGAA 306 G PG+ G G G Q G GP GD GL G GV G GF G PG Sbjct 616 LPG--QPGIQGKRGHRGAQ-----GDQGPCGDPGLKGQPGEYGVQGLTGF---QGFPGPK 665 Query 307 G--GDAGLLFGSGGAGGAGGSGRTDLGGAGGAGGKAGLIGNGGNGGAGGAGG 356 G GDAG++ SG G G G T G G+ G+IG G G G G Sbjct 666 GPEGDAGIVGISGPKGPIGHRGNT------GPLGREGIIGPTGRTGPRGEKG 711 > NP_001336884.1 collagen alpha-1(XXIV) chain isoform 2 [Homo sapiens] Length=1014 Score = 38.9 bits (89), Expect = 0.040, Method: Compositional matrix adjust. Identities = 136/412 (33%), Positives = 152/412 (37%), Gaps = 93/412 (23%) Query 4 NGGAGGSGAPGAIGGAG--------GPAGLIGVGGAGGAGGDSAVAGVIGGAGGAGGAAL 55 +G G G PG IG G GP G++G+ G G G G IG G G Sbjct 376 DGEKGEMGLPGIIGPLGRSGQTGLPGPEGIVGIPGQRGRPGKKGDKGQIGPTGEVGSR-- 433 Query 56 LFGAGGAGGAGGSGGSGAAG-----------------------GAGGAGGAGGLFASGGS 92 G G G SG GA G G G G GL G Sbjct 434 ----GPPGKIGKSGPKGARGTRGAVGHLGLMGPDGEPGIPGYRGHQGQPGPSGLPGPKGE 489 Query 93 GGFGGFAST--------------GTGGAGGTGGAGGLFASGGVGGTGGGAGSGGTGGVGG 138 G+ G ST G G G GA G GV G G G G G G Sbjct 490 KGYPGEDSTVLGPPGPRGEPGPVGDQGERGEPGAEGYKGHVGVPGLRGATGQQGPPGEPG 549 Query 139 TGGAGGLFASGGAGGAGGAATTGTGGAGGAGGKAGL-----LFGSGGAGGAGGSSGIGGF 193 G GL G G G GA G GK G+ L G G G G+ GI G Sbjct 550 DQGEQGL-----KGERGSEGNKGKKGAPGPSGKPGIPGLQGLLGPKGIQGYHGADGISG- 603 Query 194 AAGGAGGPGGAGGLFNGGGAGGAGGSGVSG-----GAGGEGGAGGSGGGGSVAGDGGAGG 248 G GP G GL G GG G +G++G G G G GS G + G G G Sbjct 604 -NPGKIGPPGKQGL--PGIRGGPGRTGLAGAPGPPGVKGSSGLPGSPG---IQGPKGEQG 657 Query 249 NAGLLAPGLAGGAGGGGGQGFDTGGAGGPGGDAGL--LVGSGGVGGAGGFGLTTGGPGAA 306 G PG+ G G G Q G GP GD GL G GV G GF G PG Sbjct 658 LPG--QPGIQGKRGHRGAQ-----GDQGPCGDPGLKGQPGEYGVQGLTGF---QGFPGPK 707 Query 307 G--GDAGLLFGSGGAGGAGGSGRTDLGGAGGAGGKAGLIGNGGNGGAGGAGG 356 G GDAG++ SG G G G T G G+ G+IG G G G G Sbjct 708 GPEGDAGIVGISGPKGPIGHRGNT------GPLGREGIIGPTGRTGPRGEKG 753 Lambda K H a alpha 0.307 0.146 0.449 0.792 4.96 Gapped Lambda K H a alpha sigma 0.267 0.0410 0.140 1.90 42.6 43.6 Effective search space used: 22892883804 Database: GCF_000001405.40_GRCh38.p14_protein.faa Posted date: Apr 24, 2023 10:59 AM Number of letters in database: 94,334,868 Number of sequences in database: 136,193 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Neighboring words threshold: 11 Window for multiple hits: 40