Streaming Video Understanding and Multi-round Interaction with Memory-enhanced Knowledge