开发者

Query optimization - VARCHAR equality vs Numeric equality

I have a large SQL Server database with about 40 columns and hundreds of millions of rows.

This table is supposed to be loose in schema so I have a lot of columns as VARCHAR(MAX) even where it could have been BIGINT, DATETIME, INT etc. Does this have an impact on querying time/effici开发者_高级运维ency? e.g. will

SELECT TOP 100 * FROM CustomerId = 34343

be faster than

SELECT TOP 100 * FROM CustomerId = '34343'

? If yes, how much faster?

And what if I use VARCHAR(MAX) instead of fixed length VARCHAR.. And what about other DBs like mySQL etc. in this regard?


Yes, comparing strings is usually slower than comparing pure numbers. Whether it is measurable depends on how the query execution engine does the comparison. If the query engine does not compare to the end of the strings - which it often won't, then your penalty is not great. Try it and see. But in theory, you'd be better off with the numeric comparison for numeric quantities.


Yes, there's definitely a performance benefit from using INT vs. VARCHAR(MAX) for comparisons. How much is really hard to say without actually measuring.

Also - there's no reason not to use VARCHAR(MAX) - but only when it's needed and when it makes sense!

See:

  • Reasons for NOT using Varchar (MAX)
  • this other Stackoverflow question

for some good reasons why you shouldn't just make everything VARCHAR(MAX) - just because you could....


And what if I use VARCHAR(MAX) instead of fixed length VARCHAR.. And what about other DBs like mySQL etc. in this regard?

  • PostgreSQL treats VARCHAR(n) as if it were TEXT CHECK(LENGTH(Column) <= n). There's no performance advantage to specifying a maximum length.
  • SQLite completely ignores length limits on VARCHAR columns.
  • MS SQL Server, however, does not allow creating indexes on VARCHAR(MAX) columns, which decreases performance.
0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜