Seminar on Internet Technologies (Winter 2016/2017): Difference between revisions

No edit summary
Line 125: Line 125:
| [http://www.net.informatik.uni-goettingen.de/people/tao_zhao Tao Zhao]
| [http://www.net.informatik.uni-goettingen.de/people/tao_zhao Tao Zhao]
| Take a look at related papers in well known conferences/workshops/journals, e.g., [http://www.ccsb.dfci.harvard.edu/web/export/sites/default/ccsb/publications/papers/2010/Song--Barabasi_NatPhysics_10.pdf]  
| Take a look at related papers in well known conferences/workshops/journals, e.g., [http://www.ccsb.dfci.harvard.edu/web/export/sites/default/ccsb/publications/papers/2010/Song--Barabasi_NatPhysics_10.pdf]  
|-
|-
| '''Learning from Imbalanced Data''' 
When building and training classifiers for classification problems, one commonly encountered problem is that of imbalanced data. For instance, in the case of a binary classifier, this means that one class is hugely overrepresented in the data available. Training classifiers for this kind of datasets has been a problem for some time. In this work, your task is to i) precisely introduce the imbalanced data problem, ii) discuss the state of the art of approaches for mitigating this problem (both from the perspective of learning algorithms and data manipulation techniques) and iii) find out what issues still remain open until today. Note that this topic requires a background in data science, and in particular in classification algorithms. Also, this topic requires a comparatively high reading effort.
| [https://www.net.informatik.uni-goettingen.de/people/David_Koll David Koll ]
| [http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=5128907&tag=1]
|-|}
|-|}