We are excited to announce that support for UTF-8 and Japanese collations in Azure Synapse Dedicated SQL pools is now generally available!
What is UTF-8?
UTF-8 allows storing of multilingual characters in data types CHAR and VARCHAR. For UTF-16 to store those multilingual characters, the data types NCHAR and NVARCHAR were needed. This means your schema definition is with only needing to use CHAR and VARCHAR data types, even if you store multilingual characters!
Learning more about UTF-8
When your data has a mix of Latin alphabet characters and other multilingual characters, you will save space and improve performance by using UTF-8. This is because it is able to use less storage space for the Latin alphabet characters.