Update Evaluation contents

#1

Add eval scripts and modify xwinograd metric scores

beomi changed pull request status to merged

Sign up or log in to comment