You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: classes/dsci550_2025a/index.html
+11-11Lines changed: 11 additions & 11 deletions
Original file line number
Diff line number
Diff line change
@@ -335,7 +335,7 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
335
335
<li>Freed, Ned, and Nathaniel Borenstein. RFC 2045. Multipurpose internet mail extensions (MIME) part one: Format of internet message bodies. 1996.</li>
336
336
<li>Freed, Ned, and Nathaniel Borenstein. RFC 2046 Multipurpose internet mail extensions (MIME) part two: Media types, November, 1996.</li>
337
337
<li>Freed, Ned. RFC 2048 "Multipurpose internet mail extensions (MIME) part four: Registration procedures." ISI (1996).</li>
338
-
<li>Hicks, Ben J., et al. "Organizing and managing personal electronic files: A mechanical engineer's perspective." ACM Transactions on Information Systems (TOIS) 26.4 (2008): 23.<strong>(Presented by: Mehrnegar Aminy)</strong></li>
338
+
<li>Hicks, Ben J., et al. "Organizing and managing personal electronic files: A mechanical engineer's perspective." ACM Transactions on Information Systems (TOIS) 26.4 (2008): 23.</li>
339
339
<li>Jackson, Andrew N. "Formats over time: Exploring UK web history." arXiv preprint arXiv:1210.1714 (2012). <strong>(Presented by: Batuhan Aydin)</strong></li>
340
340
</ul></td>
341
341
<td> </td>
@@ -350,7 +350,7 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
350
350
<li>Individual Presentations - Week 2 Papers</li>
351
351
</ul></td>
352
352
<td><ulclass="text-left"><li>Tika in Action, Chapter 3</li>
353
-
<li>Shim, Jungwon Roy. "Arium: Beyond the Desktop Metaphor: A new way of navigating, searching, and organizing personal digital data." Masters Thesus, Carnegie Mellon University (2012).<strong>(Presented by: Sean Iredell)</strong></li>
353
+
<li>Shim, Jungwon Roy. "Arium: Beyond the Desktop Metaphor: A new way of navigating, searching, and organizing personal digital data." Masters Thesus, Carnegie Mellon University (2012).</li>
354
354
<li>Crowder, Jerome, Jonathan Marion, and Michele Reilly. "File Naming in Digital Media Research: Examples from the Humanities and Social Sciences." Journal of Librarianship and Scholarly Communication 3.3 (2015). <strong>(Presented by: Eleanor Bi)</strong></li>
355
355
<li>Bik, Elisabeth M., Casadevall, Arturo, Fang, Ferrie C. The Prevalence of Inappropriate Image Duplication in Biomedical Research Publications.</li>
356
356
<li>Manku, Gurmeet Singh, Arvind Jain, and Anish Das Sarma. "Detecting near-duplicates for web crawling." Proceedings of the 16th international conference on World Wide Web. ACM, 2007.</li>
@@ -383,7 +383,7 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
383
383
<li>Ahmed, Irfan, et al. "Fast file-type identification." Proceedings of the 2010 ACM Symposium on Applied Computing. ACM, 2010. </li>
384
384
<li>Pierris, Georgios, and Stilianos Vidalis. "Forensically classifying files using HSOM algorithms." Emerging Intelligent Data and Web Technologies (EIDWT), 2012 Third International Conference on. IEEE, 2012.</li>
385
385
<li>Harris, Ryan M. "Using artificial neural networks for forensic file type identification." Master's Thesis, Purdue University (2007). <strong>(Presented by: Haoran Wang)</strong></li>
386
-
<li>Douceur, John R., and William J. Bolosky. A large-scale study of file-system contents. ACM SIGMETRICS Performance Evaluation Review 27.1 (1999): 59-70.</li>
386
+
<li>Douceur, John R., and William J. Bolosky. A large-scale study of file-system contents. ACM SIGMETRICS Performance Evaluation Review 27.1 (1999): 59-70.<strong>(Presented by: Taylor MacDonell)</strong></li>
387
387
</ul>
388
388
</td>
389
389
<td>Resources: <br/><br/>
@@ -405,7 +405,7 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
405
405
<li>Kilicoglu, Halil, et al. "Semantic MEDLINE: a web application for managing the results of PubMed Searches." Proceedings of the third international symposium for semantic mining in biomedicine. Vol. 2008. <strong>(Presented by: Aadarsh Sudhir Ghiya)</strong></li>
406
406
<li>Kobayashi, Mei, and Koichi Takeda. "Information retrieval on the web." ACM Computing Surveys (CSUR) 32.2 (2000): 144-173.<strong>(Presented by: Salome Otero Gutierrez)</strong></li>
407
407
<li>Voorhees, Ellen M., and Donna Harman. "Overview of the sixth text retrieval conference (TREC-6)." Information Processing & Management 36.1 (2000): 3-35.<strong> </strong></li>
408
-
<li>Arasu, Arvind, and Hector Garcia-Molina. Extracting structured data from web pages. Proceedings of the 2003 ACM SIGMOD international conference on Management of data. ACM, 2003.</li>
408
+
<li>Arasu, Arvind, and Hector Garcia-Molina. Extracting structured data from web pages. Proceedings of the 2003 ACM SIGMOD international conference on Management of data. ACM, 2003.<strong>(Presented by: Yihan Xia)</strong></li>
409
409
<li>Lewandowski, Dirk. "Web searching, search engines and Information Retrieval." Information Services & Use 25.3, 4 (2005): 137-147. <strong>(Presented by: Donggyu Kim)</strong></li>
410
410
<li>Weninger, Tim, William H. Hsu, and Jiawei Han. "CETR: content extraction via tag ratios." Proceedings of the 19th international conference on World wide web. ACM, 2010. </li>
411
411
<li>Karpathy, Andrej, and Li Fei-Fei. "Deep visual-semantic alignments for generating image descriptions." Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 2015.<strong>(Presented by: Yung Yee Chia)</strong></li>
@@ -431,7 +431,7 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
431
431
<li>Gowda, Thamme, and Chris A. Mattmann. "Clustering Web Pages Based on Structure and Style Similarity (Application Paper)." Information Reuse and Integration (IRI), 2016 IEEE 17th International Conference on. IEEE, 2016. <strong>(Presented by: Angel Su)</strong></li>
432
432
<li> Anquetil, Nicolas, and Timothy Lethbridge. File clustering using naming conventions for legacy systems. Proceedings of the 1997 conference of the Centre for Advanced Studies on Collaborative research. IBM Press, 1997.</li>
433
433
<li>Swierk, Edward, et al. "The Roma personal metadata service." Mobile Networks and Applications 7.5 (2002): 407-418.</li>
434
-
<li>Karypis, Michael Steinbach George, Vipin Kumar, and Michael Steinbach. "A comparison of document clustering techniques." KDD workshop on Text Mining. 2000.</li>
434
+
<li>Karypis, Michael Steinbach George, Vipin Kumar, and Michael Steinbach. "A comparison of document clustering techniques." KDD workshop on Text Mining. 2000.<strong>(Presented by: Janak Tarun Thakkar)</strong></li>
435
435
<li>Marchionini, Gary. "Exploratory search: from finding to understanding." Communications of the ACM 49.4 (2006): 41-46.</li>
436
436
</ul>
437
437
</td>
@@ -458,7 +458,7 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
458
458
<li>Koehn, Philipp, et al. "Moses: Open source toolkit for statistical machine translation." Proceedings of the 45th annual meeting of the ACL on interactive poster and demonstration sessions. Association for Computational Linguistics, 2007. </li>
459
459
<li>Post, Matt, et al. "Joshua 5.0: Sparser, better, faster, server." Proceedings of the Eighth Workshop on Statistical Machine Translation. 2013.</li>
460
460
<li>Lins, Rafael Dueire, and Paulo Gonçalves. Automatic language identification of written texts. Proceedings of the 2004 ACM symposium on Applied computing. ACM, 2004. <strong>(Presented by: Kylan Parayao)</strong></li>
461
-
<li>Papineni, Kishore, et al. "BLEU: a method for automatic evaluation of machine translation." Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics, 2002.<strong>(Presented by: Yihan Xia)</strong></li>
461
+
<li>Papineni, Kishore, et al. "BLEU: a method for automatic evaluation of machine translation." Proceedings of the 40th annual meeting on association for computational linguistics. Association for Computational Linguistics, 2002.</li>
462
462
<li>Bahdanau, Dzmitry, Kyunghyun Cho, and Yoshua Bengio. "Neural machine translation by jointly learning to align and translate." arXiv preprint arXiv:1409.0473 (2014).</li>
463
463
<li>Tromp, Erik, and Mykola Pechenizkiy. "Graph-based n-gram language identification on short texts." Proc. 20th Machine Learning conference of Belgium and The Netherlands. 2011.</li>
464
464
<li>Lopez-Moreno, Ignacio, et al. "Automatic language identification using deep neural networks." Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on. IEEE, 2014. </li>
@@ -540,7 +540,7 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
540
540
<li>Elsayed, Tamer, Jimmy Lin, and Douglas W. Oard. "Pairwise document similarity in large collections with MapReduce." Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers. Association for Computational Linguistics, 2008. <strong>(Presented by: Aaron Kuo)</strong></li>
541
541
<li>M. Bernaschi, M. Cianfriglia, A. Di Marco, A. Sabellico, G. Me, G. Carbone, G. Totaro. Forensic Disk Image Indexing and Search in an HPC environment. IEEE International Conference on High Performance Computing & Simulation (HPCS), 2014.</li>
542
542
<li>Meusel, Robert, Peter Mika, and Roi Blanco. "Focused crawling for structured data." Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management. ACM, 2014. <strong>(Presented by: Caroline Ghanbary)</strong></li>
543
-
<li>Niu, Feng, et al. "DeepDive: Web-scale Knowledge-base Construction using Statistical Learning and Inference." VLDS 12 (2012): 25-28.</li>
543
+
<li>Niu, Feng, et al. "DeepDive: Web-scale Knowledge-base Construction using Statistical Learning and Inference." VLDS 12 (2012): 25-28.<strong>(Presented by: Shiyi Li)</strong></li>
544
544
<li>Mattmann, C. A., Oh, J. H., Palsulich, T., McGibbney, L. J., Gil, Y., & Ratnakar, V. (2015, November). DRAT: An Unobtrusive, Scalable Approach to Large Scale Software License Analysis. In Automated Software Engineering Workshop (ASEW), 2015 30th IEEE/ACM International Conference on (pp. 97-101). IEEE. </li>
545
545
</ul>
546
546
</td>
@@ -592,9 +592,9 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
592
592
<li>Individual Presentations </li>
593
593
</ul></td>
594
594
<td><ulclass="text-left"><li>Tika in Action, Chapter 11</li>
595
-
<li>Nowell, Lucy Terry, et al. "Visualizing search results: some alternatives to query-document similarity." Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 1996.<strong>(Presented by: Christelle Bou Nehme Sawaya)</strong></li>
595
+
<li>Nowell, Lucy Terry, et al. "Visualizing search results: some alternatives to query-document similarity." Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval. ACM, 1996.</li>
596
596
<li>Shneiderman, Ben. "The eyes have it: A task by data type taxonomy for information visualizations." Visual Languages, 1996. Proceedings., IEEE Symposium on. IEEE, 1996. <strong>(Presented by: Tianxing Chen)</strong></li>
597
-
<li>Gottron, Thomas. "Evaluating content extraction on HTML documents." Proceedings of the 2nd International Conference on Internet Technologies and Applications (ITA’07). 2007. <strong>(Presented by: Aanchal Dinesh Pandey)</strong></li>
597
+
<li>Gottron, Thomas. "Evaluating content extraction on HTML documents." Proceedings of the 2nd International Conference on Internet Technologies and Applications (ITA’07). 2007. </li>
598
598
<li>Leuski, Anton. "Evaluating document clustering for interactive information retrieval." Proceedings of the tenth international conference on Information and knowledge management. ACM, 2001.</li>
599
599
<li>Bailey, Peter, et al. "Evaluating search systems using result page context." Proceedings of the third symposium on Information interaction in context. ACM, 2010.</li>
600
600
</ul>
@@ -617,7 +617,7 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
617
617
</td>
618
618
<td><ulclass="text-left">
619
619
<li>Palamuttam, Rahul, et al. "SciSpark: Applying in-memory distributed computing to weather event detection and tracking." Big Data (Big Data), 2015 IEEE International Conference on. IEEE, 2015. <strong>(Presented by: Yumeng Zhang)</strong></li>
620
-
<li>Leavitt, Neal. "Will NoSQL databases live up to their promise?." Computer 43.2 (2010). <strong>(Presented by: Aidot Sairambay)</strong></li>
620
+
<li>Leavitt, Neal. "Will NoSQL databases live up to their promise?." Computer 43.2 (2010). <strong>(Presented by: Aidos Sairambay)</strong></li>
621
621
<li>Stonebraker, Michael. "SQL databases v. NoSQL databases." Communications of the ACM 53.4 (2010): 10-11. <strong>(Presented by: Megan Rajan)</strong></li>
622
622
<li>Stonebraker, Michael. "Stonebraker on NoSQL and enterprises." Communications of the ACM 54.8 (2011): 10-11.</li>
623
623
<li>Rafique, Ansar, et al. "On the performance impact of data access middleware for nosql data stores." IEEE Transactions on Cloud Computing (2015).</li>
@@ -645,7 +645,7 @@ <h2 class="section-heading">Schedule <font class="usc-color">(subject to change;
645
645
<td><ulclass="text-left"><li>Tika in Action, Chapter 12 - 14</li>
646
646
<li>C. Mattmann, D. Freeborn, D. Crichton, B. Foster, A. Hart, D. Woollard, S. Hardman, P. Ramirez, S. Kelly, A. Y. Chang, C. E. Miller. A Reusable Process Control System Framework for the Orbiting Carbon Observatory and NPP Sounder PEATE missions. In Proceedings of the 3rd IEEE Intl Conference on Space Mission Challenges for Information Technology (SMC-IT 2009), pp. 165-172, July 19 - 23, 2009.
647
647
</li>
648
-
<li>Wilkinson, Mark D., et al. "The FAIR Guiding Principles for scientific data management and stewardship." Scientific data 3 (2016): 160018. <strong>(Presented by: Yafei Wang)</strong></li>
648
+
<li>Wilkinson, Mark D., et al. "The FAIR Guiding Principles for scientific data management and stewardship." Scientific data 3 (2016): 160018. </li>
649
649
<li>Buneman, Peter, et al. "Archiving scientific data." ACM Transactions on Database Systems (TODS) 29.1 (2004): 2-42.</li>
650
650
<li>Fox, Peter, and James Hendler. "Changing the equation on scientific data visualization." Science 331.6018 (2011): 705-708.<strong>(Presented by: Liang Qian)</strong></li>
651
651
<li>Plale, Beth, et al. "Active management of scientific data." IEEE Internet Computing 9.1 (2005): 27-34.</li>
0 commit comments