* works on interlm and vicuna * support GQA * remove comment * update readme, add logger, default tp=1 * remove log