BideDPO: Conditional Image Generation with Simultaneous Text and Condition Alignment