1,721,108 research outputs found
Deploying Differential Privacy for the 2020 Census of Population and Housing
The views in this presentation are those of the author, and not those of the U.S. Census Bureau
Lessons Learned Writing Computer Forensics Tools and Managing a 30TB Digital Evidence Corpus
DFRWS 2012, Aug. 6-8, 2012, Washington, DC.The article of record as published may be found at http://dx.doi.org/10.1016/j.diin.2012.05.002Refereed Conference PaperWriting digital forensics (DF) tools is difficult because of the diversity of data types that needs to be processed, the need for high performance, the skill set of most users, and the requirement that the software run without crashing. Developing this software is dramatically easier when one possesses a few hundred disks of other people's data for testing purposes. This paper presents some of the lessons learned by the author over the past 14 years developing DF tools and maintaining several research corpora that currently total roughly 30TB
A study of topic and topic change in conversational threads
This thesis applies Latent Dirichlet Allocation (LDA) to the problem of topic and topic change in conversational threads using e-mail. We demonstrate that LDA can be used to successfully classify raw e-mail messages with threads to which they belong, and compare the results with those for processed threads, where quoted and reply text have been removed. Raw thread classification performs better, but processed threads show promise. We then present two new, unsupervised techniques for identifying topic change in e-mail. The first is a keyword clustering approach using LDA and DBSCAN to identify clusters of topics, and transition points between them. The second is a sliding window technique which assesses the current topic for every window, identifying transition points. The keyword clustering performs better than the sliding window approach. Both can be used as a baseline for future work.Approved for public release; distribution is unlimited.NASA Ames Research Center author (civilian).http://archive.org/details/astudyoftopicndt10945457
Recommended from our members
The User is not the enemy
Many system security departments treat users as a security risk to be controlled. The general consensus is that most users are careless and unmotivated when it comes to system security. In a recent study, we found that users may indeed compromise computer security mechanisms, such as password authentication, both knowing and unknowingly. A closer analysis, however, revealed that such behavior is often caused by the way in which security mechanisms are implemented, and users’ lack of knowledge. We argue that to change this state of affairs, security departments need to communicate more with users, and adopt a user-centered design approach
Triage visualization for digital media exploitation
Digital forensic examiners are overwhelmed by case loads and data volumes and must prioritize their work. This thesis hypothesis that digital forensic examiners can employ triage visualizations to prioritize work loads. This thesis presents a simple one page visualization of disk activity for Windows FAT and NTFS filesystems. The visualization is constructed from filesystem meta data carved by the open source bulk_extractor digital forensics application. The visualization does not require further examination or reconstruction of file system metadata. The visualization is able to detect minor obfuscation or modification and overwriting of file system timestamps.Approved for public release; distribution is unlimited.Civilian, Vista Research Inc.http://archive.org/details/triagevisualizat109453763
An analysis of the applicability of federal law regarding hash-based searches of digital media
The Fourth Amendment of the United States (U.S.) Constitution limits the ability of the government to search U.S. persons without cause or justification. The application of the Fourth Amendment to digital forensics search techniques is still evolving. This thesis summarizes current federal law and recent judicial rulings that can apply Fourth Amendment doctrine to current digital forensics techniques. It uses three hypothetical scenarios to show how current law could be applied to new techniques now under development: the use of sector hashes to find traces of digital contraband; the use of random sampling to rapidly triage large digital media; and the use of similarity functions to find documents that are similar but not identical to target documents.Approved for public release; distribution is unlimited.Outstanding ThesisLieutenant Commander, United States Navyhttp://archive.org/details/annalysisofpplic109454271
An automated acquisition system for media exploitation
uses DBUS and the Hardware Abstraction Layer (HAL) to automatically detect device insertion and start forensic imaging.Approved for public release; distribution is unlimited.Federal Cyber Corps author (civilian).http://archive.org/details/anutomatedcquisi10945409
- …
