You've got determined that the ideal method of clear up your trouble is to work with a CNN combined with a bounding box detector, that even further procedures image crops after which makes use of an LSTM to combine almost everything. It requires ten minutes only for your GPU to initialize your design.You don’t know what to do which has a barrier