2010-02-04 104 views
3

如何優化此查詢?架構:複雜SQL優化與通用語言

mysql> show columns from transactionlog; 
+---------------+-------------------------------------------+------+-----+---------+----------------+ 
| Field   | Type          | Null | Key | Default | Extra   | 
+---------------+-------------------------------------------+------+-----+---------+----------------+ 
| id   | int(11)         | NO | PRI | NULL | auto_increment | 
| transactionid | varchar(10)        | NO | MUL | NULL |    | 
| queryid  | tinyint(4)        | NO |  | NULL |    | 
| tableid  | varchar(30)        | NO | MUL | NULL |    | 
| tupleid  | int(11)         | NO |  | NULL |    | 
| querytype  | enum('select','insert','delete','update') | NO |  | NULL |    | 
| schemaname | varchar(20)        | YES |  | NULL |    | 
| partition  | tinyint(3) unsigned      | YES |  | NULL |    | 
+---------------+-------------------------------------------+------+-----+---------+----------------+ 
8 rows in set (0.04 sec) 

查詢:

select concat(weight, ' ', ids, '\n') 
from (
    select 
    tableid, 
    tupleid, 
    group_concat(id separator ' ') as ids, 
    (
     select count(distinct transactionid) 
     from transactionlog 
     where transactionid in (
     select transactionid 
     from transactionlog 
     where (tableid, tupleid, querytype) = 
       (t.tableid, t.tupleid, 'update') 
     group by transactionid 
     having count(*) > 0 
    ) 
    ) weight 
    from transactionlog t 
    group by tableid, tupleid 
    having weight > 0 and count(*) > 1 
) u; 

這是輸出講解和MK-視覺解釋:

+----+--------------------+----------------+-------+---------------+---------------+---------+-----------+------+------------------------------ 
----------------+ 
| id | select_type  | table   | type | possible_keys | key   | key_len | ref  | rows | Extra          | 
+----+--------------------+----------------+-------+---------------+---------------+---------+-----------+------+----------------------------------------------+ 
| 1 | PRIMARY   | <derived2>  | ALL | NULL   | NULL   | NULL | NULL  | 13 |            | 
| 2 | DERIVED   | t    | ALL | NULL   | NULL   | NULL | NULL  | 68 | Using filesort        | 
| 3 | DEPENDENT SUBQUERY | transactionlog | index | NULL   | transactionid | 12  | NULL  | 68 | Using where; Using index      | 
| 4 | DEPENDENT SUBQUERY | transactionlog | ref | tableid  | tableid  | 36  | func,func | 2 | Using where; Using temporary; Using filesort | 
+----+--------------------+----------------+-------+---------------+---------------+---------+-----------+------+----------------------------------------------+ 
Table scan 
rows   13 
+- DERIVED 
    table   derived(t,transactionlog,temporary(transactionlog)) 
    +- DEPENDENT SUBQUERY 
     +- DEPENDENT SUBQUERY 
     | +- Filesort 
     | | +- TEMPORARY 
     | |  table   temporary(transactionlog) 
     | |  +- Filter with WHERE 
     | |  +- Bookmark lookup 
     | |   +- Table 
     | |   | table   transactionlog 
     | |   | possible_keys tableid 
     | |   +- Index lookup 
     | |    key   transactionlog->tableid 
     | |    possible_keys tableid 
     | |    key_len  36 
     | |    ref   func,func 
     | |    rows   2 
     | +- Filter with WHERE 
     |  +- Index scan 
     |  key   transactionlog->transactionid 
     |  key_len  12 
     |  rows   68 
     +- Filesort 
     +- Table scan 
      rows   68 
      +- Table 
       table   t 

這是一個很大的工作。

results = query(""" 
    select tableid, tupleid, transactionid, id, querytype 
    from transactionlog_2warehouse 
""") 
_tab, _tup = None 
ids = [] 
weight = 0 
saw_upd = False 
for tab, tup, txn, id, qt in results: 
    if (_tab, _tup) != (tab, tup): 
    if len(ids) > 1 and weight > 0: 
     print weight, ids 
    weight = 0 
    ids = [] 
    _txn = None 
    if _txn != txn: 
    saw_upd = False 
    if qt == 'update' and not saw_upd: 
    weight += 1 
    saw_upd = True 
    ids += [id] 

是否有可能實現使用純SQL Python的單通道性能:同時使單通,我可以用Python語言編寫的等效邏輯?提前致謝!

+0

將其更改爲PL/SQL,你可能有機會:P – glasnt 2010-02-04 05:52:17

+0

你能不能給我們一些信息,你需要提取什麼信息。聽起來像一個有趣的問題,但沒有要求,我不能花時間猜測/做偵探工作..謝謝。 – lexu 2010-02-04 05:55:24

回答

0

用途:

SELECT CONCAT(x.weight, ' ', GROUP_CONCAT(t.id SEPARATOR ' '), '\n') 
    FROM TRANSACTIONLOG t 
    JOIN (SELECT tl.tableid, 
       tl.tupleid, 
       COUNT(DISTINCT tl.transactionid) AS weight 
      FROM TRANSACTIONLOG tl 
      WHERE tl.querytype = 'update' 
     GROUP BY tl.tableid, tl.tupleid) x ON x.tableid = t.tableid 
              AND x.tupleid = t.tupleid 
              AND x.weight > 0 
GROUP BY t.tableid, t.tupleid, x.weight 
    HAVING COUNT(*) > 1